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THE HIGH BONE MASS GENE OF llql3.3 

FIELD OF THE INVENTION 

The present invention relates generally to the field of genetics, genomics and 
molecular biology. More particularly, the invention relates to methods and materials 
5 used to isolate, detect and sequence a high bone mass gene and corresponding wild- 
type gene, and mutants thereof. The present invention also relates to the high bone 
mass gene, the corresponding wild-type gene, and mutants thereof. The genes 
identified in the present invention are implicated in the ontology and physiology of 
bone development. The invention also provides nucleic acids, proteins, cloning 

1 0 vectors, expression vectors, transformed hosts, methods of developing 

pharmaceutical compositions, methods of identifying molecules involved in bone 
development, and methods of diagnosing and treating diseases involved in bone 
development. In preferred embodiments, the present invention is directed to 
methods for treating, diagnosing, preventing and screening for normal and abnormal 

15 conditions of bone, including metabolic bone diseases such as osteoporosis. 

BACKGROUND OF THE INVENTION 

Two of the most common types of osteoporosis are postmenopausal and 
senile osteoporosis. Osteoporosis affects men as well as women, and, taken with 
other abnormalities of bone, presents an ever-increasing health risk for an aging 

20 population. The most common type of osteoporosis is that associated with 

menopause. Most women lose between 20-60% of the bone mass in the trabecular 
compartment of the bone within 3-6 years after the cessation of menses. This rapid 
loss is generally associated with an increase of bone resorption and formation. 
However, the resorptive cycle is more dominant and the result is a net loss of bone 

25 mass. Osteoporosis is a common and serious disease among postmenopausal 

women. There are an estimated 25 million women in the United States alone who 
are afflicted with this disease. The results of osteoporosis are both personally 
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harmful, and also account for a large economic loss due to its chronicity and the 
need for extensive and long-term support (hospitalization and nursing home care) 
from the disease sequelae. This is especially true in more elderly patients. 
Additionally, while osteoporosis is generally not thought of as a life-threatening 
5 condition, a 20-30% mortality rate is related to hip fractures in elderly women. A 
large percentage of this mortality rate can be directly associated with 
postmenopausal osteoporosis. 

The most vulnerable tissue in the bone to the effects of postmenopausal 
osteoporosis is the trabecular bone. This tissue is often referred to as spongy bone 

10 and is particularly concentrated near the ends of the bone near the joints and in the 
vertebrae of the spine. The trabecular tissue is characterized by small structures 
which inter-connect with each other as well as the more solid and dense cortical 
tissue which makes up the outer surface and central shaft of the bone. This criss- 
cross network of trabeculae gives lateral support to the outer cortical structure and is 

15 critical to the biomechanical strength of the overall structure. In postmenopausal 
osteoporosis, it is primarily the net resorption and loss of the trabeculae which lead 
to the failure and fracture of the bone. In light of the loss of the trabeculae in 
postmenopausal women, it is not surprising that the most common fractures are 
those associated with bones which are highly dependent on trabecular support, e.g., 

20 the vertebrae, the neck of the femur, and the forearm. Indeed, hip fracture, Colle f s 
fractures, and vertebral crush fractures are indicative of postmenopausal 
osteoporosis. 

One of the earliest generally accepted methods for treatment of 
postmenopausal osteoporosis was estrogen replacement therapy. Although this 
25 therapy frequently is successful, patient compliance is low, primarily due to the 

undesirable side-effects of chronic estrogen treatment. Frequently cited side-effects 
of estrogen replacement therapy include reinitiation of menses, bloating, depression, 
and fear of breast or uterine cancer. In order to limit the known threat of uterine 
cancer in those women who have not undergone a hysterectomy, a protocol of 
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estrogen and progestin cyclic therapy is often employed. This protocol is similar to 
that which is used in birth control regimens, and often is not tolerated by women 
because of the side-effects characteristic of progestin. More recently, certain 
antiestrogens, originally developed for the treatment of breast cancer, have been 
5 shown in experimental models of postmenopausal osteoporosis to be efficacious. 

Among these agents is raloxifene (See, U.S. Patent No. 5,393,763, and Black et al, J. 
Clin. Invest, 93:63-69 (1994)). In addition, tamoxifene, a widely used clinical agent 
for the treatment of breast cancer, has been shown to increase bone mineral density 
in post menopausal women suffering from breast cancer (Love et al, N. Engl J. 

10 Med., 326:852-856 (1992)). 

Another therapy for the treatment of postmenopausal osteoporosis is the use 
of calcitonin. Calcitonin is a naturally occurring peptide which inhibits bone 
resorption and has been approved for this use in many countries (Overgaard et al, Br. 
Med. J., 305:556-561 (1992)). The use of calcitonin has been somewhat limited, 

15 however. Its effects are very modest in increasing bone mineral density and the 

treatment is very expensive. Another therapy for the treatment of postmenopausal, 
osteoporosis is the use of bis-phosphonates. These compounds were originally 
developed for use in Paget's disease and malignant hypercalcemia. They have been 
shown to inhibit bone resorption. Alendronate, one compound of this class, has 

20 been approved for the treatment of postmenopausal osteoporosis. These agents may 
be helpful in the treatment of osteoporosis, but these agents also have potential 
liabilities which include osteomalacia, extremely long half-life in bone (greater than 
2 years), and possible "frozen bone syndrome," e.g., the cessation of normal bone 
remodeling. 

25 Senile osteoporosis is similar to postmenopausal osteoporosis in that it is 

marked by the loss of bone mineral density and resulting increase in fracture rate, 
morbidity, and associated mortality. Generally, it occurs in later life, i.e., after 70 
years of age. Historically, senile osteoporosis has been more common in females, 
but with the advent of a more elderly male population, this disease is becoming a 
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major factor in the health of both sexes. It is not clear what, if any, role hormones 
such as testosterone or estrogen have in this disease, and its etiology remains 
obscure. Treatment of this disease has not been very satisfactoiy. Hormone therapy, 
estrogen in women and testosterone in men, has shown equivocal results; calcitonin 



The peak mass of the skeleton at maturity is largely under genetic control. 
Twin studies have shown that the variance in bone mass between adult monozygotic 
twins is smaller than between dizygotic twins (Slemenda et al, J. Bone Miner. Res., 
6:561-567 (1991); Young et al, J. Bone Miner. Res., 6:561-567 (1995); Pocock et al, 

10 J. Clin. Invest., 80:706-710 (1987); Kelly et al,J. BoneMiner. Res., 8:11-17 (1993)), 
and it has been estimated that up to 60% or more of the variance in skeletal mass is 
inherited (Krall et al, J. Bone Miner. Res., 10:S367 (1993)). Peak skeletal mass is 
the most powerful determinant of bone mass in elderly years (Hui et al, Ann. Int. 
Med., 1 1 1:355-361 (1989)), even though the rate of age-related bone loss in adult 

1 5 and later life is also a strong determinant (Hui et al, Osteoporosis Int., 1 :30-34 

(1995)). Since bone mass is the principal measurable determinant of fracture risk, 
the inherited peak skeletal mass achieved at maturity is an important determinant of 
an individual's risk of fracture later in life. Thus, study of the genetic basis of bone 
mass is of considerable interest in the etiology of fractures due to osteoporosis. 

20 Recently, a strong interest in the genetic control of peak bone mass has 

developed in the field of osteoporosis. The interest has focused mainly on candidate 
genes with suitable polymorphisms to test for association with variation in bone 
mass within the normal range, or has focused on examination of genes and gene loci 
associated with low bone mass in the range found in patients with osteoporosis. The 

25 vitamin D receptor locus (VDR) (Morrison et al, Nature, 367:284-287 (1994)), PTH 
gene (Howard et al, J. Clin. Endocrinol Metab., 80:2800-2805 (1995); Johnson et 
al, J. Bone Miner. Res. ,8:11-17 (1 995); Gong et al, J. Bone Miner. Res. , 1 0:S462 
(1995)) and the estrogen receptor gerie (Hosoi et al, J. Bone Miner. Res., 10:S170 
(1995); Morrison et al, Nature, 367:284-287 (1994)) have figured most prominently 



5 



and bis-phosphonates may be of some utility. 
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in this work. These studies are difficult because bone mass (the phenotype) is a 
continuous, quantitative, polygenic trait, and is confounded by environmental factors 
such as nutrition, co-morbid disease, age, physical activity, and other factors. Also, 
this type of study design requires large numbers of subjects. In particular, the results 
5 of VDR studies to date have been confusing and contradictory (Garnero et al, J. 
Bone Miner. Res., 10:1283-1288 (1995); Eisman et al, J. Bone. Miner. Res., 
10:1289-1293 (1995); Peacock, J. Bone Miner. Res., 10:1294-1297 (1995)). 
Furthermore, the work thus far has not shed much light on the mechanism(s) 
whereby the genetic influences might exert their effect on bone mass. 

10 While it is well known that peak bone mass is largely determined by genetic 

rather than environmental factors, studies to determine the gene loci (and ultimately 
the genes) linked to variation in bone mass are difficult and expensive. Study 
designs which utilize the power of linkage analysis, e.g., sib-pair or extended family, 
are generally more informative than simple association studies, although the latter 

15 do have value. However, genetic linkage studies involving bone mass are hampered 
by two major problems. The first problem is the phenotype, as discussed briefly 
above. Bone mass is a continuous, quantitative trait, and establishing a discrete 
phenotype is difficult. Each anatomical site for measurement may be influenced by 
several genes, many of which may be different from site to site. The second 

20 problem is the age component of the phenotype. By the time an individual can be 
identified as having low bone mass, there is a high probability that their parents or 
other members of prior generations will be deceased and therefore unavailable for 
study, and younger generations may not have even reached peak bone mass, making 
their phenotyping uncertain for genetic analysis. 

25 Regardless, linkage analysis can be used to find the location of a gene 

causing a hereditary "disorder" and does not require any knowledge of the 
biochemical nature of the disorder, i.e., a mutated protein that is believed to cause 
the disorder does not need to be known. Traditional approaches depend on 
assumptions concerning the disease process that might implicate a known protein as 
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a candidate to be evaluated. The genetic localization approach using linkage 
analysis can be used to first find the general chromosomal region in which the 
defective gene is located and then to gradually reduce the size of the region in order 
to determine the location of the specific mutated gene as precisely as possible. After 
5 the gene itself is discovered within the candidate region, the messenger RNA and the 
protein are identified and, along with the DNA, are checked for mutations. 

The genetic localization approach has practical implications since the 
location of the disease can be used for prenatal diagnosis even before the altered 
gene that causes the disease is found. Linkage analysis can enable families, even 
10 many of those that do not have a sick child, to know whether they are carriers of a 
disease gene and to evaluate the condition of an unborn child through molecular 
diagnosis. The transmission of a disease within families, then, can be used to find 
the defective gene. As used herein, reference to "high bone mass" (HBM) is 
analogous to reference to a disease state, although from a practical standpoint high 
15 bone mass can actually help a subject avoid the disease known as osteoporosis. 

Linkage analysis is possible because of the nature of inheritance of 
chromosomes from parents to offspring. During meiosis, the two parental 
homologues pair to guide their proper separation to daughter cells. While they are 
lined up and paired, the two homologues exchange pieces of the chromosomes, in an 
20 event called "crossing over" or "recombination." The resulting chromosomes are 
chimeric, that is, they contain parts that originate from both parental homologues. 
The closer together two sequences are on the chromosome, the less likely that a 
recombination event will occur between them, and the more closely linked they are. 
In a linkage analysis experiment, two positions on the chromosomes are followed 
25 from one generation to the next to determine the frequency of recombination 

between them. In a study of an inherited disease, one of the chromosomal positions 
is marked by the disease gene or its normal counterpart, i.e., the inheritance of the 
chromosomal region can be determined by examining whether the individual 
displays symptoms of the disorder or not. The other position is marked by a DNA 
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sequence that shows natural variation in the population such that the two 
homologues can be distinguished based on the copy of the "marker" sequence that 
they possess. In every family, the inheritance of the genetic marker sequence is 
compared to the inheritance of the disease state. If, within a family carrying an 
5 autosomal dominant disorder such as high bone mass, every affected individual 

carries the same form of the marker and all the unaffected individuals carry at least 
one different form of the marker, there is a great probability that the disease gene 
and the marker are located close to each other. In this way, chromosomes may be 
systematically checked with known markers and compared to the disease state. The 

10 data obtained from the different families is combined, and analyzed together by a 
computer using statistical methods. The result is information indicating the 
probability of linkage between the genetic marker and the disease allowing different 
distances between them. A positive result can mean that the disease is very close to 
the marker, while a negative result indicates that it is far away on that chromosome, 

15 or on an entirely different chromosome. 

Linkage analysis is performed by typing all members of the affected family 
at a given marker locus and evaluating the co-inheritance of a particular disease state 
with the marker probe, thereby determining how often the two of them are co- 
inherited. The recombination frequency can be used as a measure of the genetic 

20 distance between two gene loci. A recombination frequency of 1% is equivalent to 1 
map unit, or 1 centiMorgan (cM), which is roughly equivalent to 1,000 kb of DNA. 
This relationship holds up to frequencies of about 20% or 20 cM. 

The entire human genome is 3,300 cM long. In order to find an unknown 
disease gene within 5-10 cM of a marker locus, the whole human genome can be 

25 searched with roughly 330 informative marker loci spaced at approximately 1 0 cM 
intervals (Botstein et al, Am. J. Hum. Genet, 32:314-331 (1980)). The reliability of 
linkage results is established by using a number of statistical methods. The method 
most commonly used for the analysis of linkage in humans is the LOD score method 
(Morton, Prog. Clin. Biol Res., 147:245-265 (1984), Morton et al, Am. J. Hum. 
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GeneL, 38:868-883 (1986)) which was incorporated into the computer program 
LIPED by Ott, Am. J. Hum. Genet, 28:528-529 (1976). LOD scores are the 
logarithm of the ratio of the likelihood that two loci are linked at a given distance to 
that they are not linked (>50 cM apart). The advantage of using logarithmic values 
is that they can be summed among families with the same disease. This becomes 
necessary given the relatively small size of human families. 

By convention, a total LOD score greater than 4- 3.0 (that is, odds of linkage 
at the specified recombination frequency being 1000 times greater than odds of no 
linkage) is considered to be significant evidence for linkage at that particular 
recombination frequency. A total LOD score of less than - 2.0 (that is, odds of no 
linkage being 100 times greater than odds of linkage at the specified frequency) is 
considered to be strong evidence that the two loci under consideration are not linked 
at that particular recombination frequency. Until recently, most linkage analyses 
have been performed on the basis of two-point data, which is the relationship 
between the disorder under consideration and a particular genetic marker. However, 
as a result of the rapid advances in mapping the human genome over the last few 
years, and concomitant improvements in computer methodology, it has become 
feasible to carry out linkage analyses using multi-point data. Multi-point analysis 
provide a simultaneous analysis of linkage between the disease and several linked 
genetic markers, when the recombination distance among the markers is known. 

Multi-point analysis is advantageous for two reasons. First, the 
informativeness of the pedigree is usually increased. Each pedigree has a certain 
amount of potential information, dependent on the number of parents heterozygous 
for the marker loci and the number of affected individuals in the family. However, 
few markers are sufficiently polymorphic as to be informative in all those 
individuals. If multiple markers are considered simultaneously, then the probability 
of an individual being heterozygous for at least one of the markers is greatly 
increased. Second, an indication of the position of the disease gene among the 
markers may be determined. This allows identification of flanking markers, and thus 
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eventually allows isolation of a small region in which the disease gene resides. 
Lathrop et al, Proc. Natl Acad. Sci. USA, 81:3443-3446 (1984) have written the 
most widely used computer package, LINKAGE, for multi-point analysis. 

There is a need in the art for identifying the gene associated with a high bone 
5 mass phenotype. The present invention is directed to this, as well as other, 
important ends. 

SUMMARY OF THE INVENTION 

The present invention describes the Zmaxl gene and the HBM gene on 
chromosome 1 lql3.3 by genetic linkage and mutation analysis. The use of 
additional genetic markers linked to the genes has aided this discovery. By using 
linkage analysis and mutation analysis, persons predisposed to HBM may be readily 
identified. Cloning methods using Bacterial Artificial Chromosomes have enabled 
the inventors to focus on the chromosome region of 1 lql3.3 and to accelerate the 
sequencing of the autosomal dominant gene. In addition, the invention identifies the 
Zmaxl gene and the HBM gene, and identifies the guanine-to-thymine 
polymorphism mutation at position 582 in the Zmaxl gene that produces the HBM 
gene and the HBM phenotype. 

The present invention identifies the Zmaxl gene and the HBM gene, which 
can be used to determine if people are predisposed to HBM and, therefore, not 
susceptible to diseases characterized by reduced bone density, including, for 
example, osteoporosis, or are predisposed and susceptible to diseases characterized 
by abnormally high bone density, such as, for example, osteoporosis. Older 
individuals carrying the HBM gene express the HBM protein, and, therefore, do not 
develop osteoporosis. In other words, the HBM gene is a suppressor of 
osteoporosis. This in vivo observation is a strong evidence that treatment of normal 
individuals with the HBM gene or protein, or fragments thereof, will ameliorate 
osteoporosis. 
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Moreover, such treatment will be indicated in the treatment of bone lesions, 
particularly bone fractures, for bone remodeling in the healing of such lesions. For 
example, persons predisposed to or suffering from stress fractures (i.e., the 
accumulation of stress-induced microfractures, eventually resulting in a true fracture 
5 through the bone cortex) may be identified and/or treated by means of the invention. 
Moreover, the methods and compositions of the invention will be of use in the 
treatment of secondary osteoporosis, where the course of therapy involves bone 
remodeling, such as endocrine conditions accompanying corticosteroid 
administration, hyperthyroidism, hypogonadism, hematologic malignancies, 
10 malabsorption and alcoholism, as well as disorders associated with vitamin D and/or 
phosphate metabolism, such as osteomalacia and rickets, and diseases characterized 
by abnormal or disordered bone remodeling, such as Paget f s disease, and in 
neoplasms of bone, which may be benign or malignant 

In various embodiments, the present invention is directed to nucleic acids, 
1 5 proteins, vectors, and transformed hosts of HBM and Zmaxl . 

Additionally, the present invention is directed to applications of the above 
embodiments of the invention including, for example, gene therapy, pharmaceutical 
development, and diagnostic assays for bone development disorders. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing, 
20 preventing and screening for osteoporosis. 

These and other aspects of the present invention are described in more detail 

below. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 shows the pedigree of the individuals used in the genetic linkage 
studies. Under each individual is an ID number, the z-score for spinal BMD, and the 
allele calls for the critical markers on chromosome 1 1 . Solid symbols represent 
"affected" individuals. Symbols containing "N" are "unaffected" individuals. DNA 
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from 37 individuals was genotyped. Question marks denote unknown genotypes or 
individuals who were not genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in 
llql3.3. STS markers derived from genes, ESTs, microsatellites, random 
5 sequences, and BAC endsequences are denoted above the long horizontal line. For 
markers that are present in GDB the same nomenclature has been used. Locus 
names (Dl 1S####) are listed in parentheses after the primary name if available. 
STSs derived from BAC endsequences are listed with the BAC name first followed 
by L or R for the left and right end of the clone, respectively. The two large arrows 

10 indicate the genetic markers that define the HBM critical region. The horizontal 
lines below the STSs indicate BAC clones identified by PCR-based screening of a 
nine-fold coverage BAC library. Open circles indicate that the marker did not 
amplify the corresponding BAC library address during library screening. Clone 
names use the following convention: B for BAC, the plate, row and column address, 

1 5 followed by -H indicating the HBM project (i.e., B36F16-H). 

Figs. 3A-3F show the genomic structure of Zmaxl with flanking intron 
sequences. Translation is initiated by the underlined 11 ATG" in exon 1 . The site of 
the polymorphism in the HBM gene is in exon 3 and is represented by the 
underlined "G," whereby this nucleotide is a "T" in the HBM gene. The 3 1 

20 untranslated region of the mRNA is underlined within exon 23 (exon 1, SEQ ID 
NO:40; exon 2, SEQ ID NO:41; exon 3, SEQ ID NO:42; exon 4, SEQ ID NO:43; 
exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, SEQ ID NO:46; exon 8, 
SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID NO:49; exon 11, SEQ ID 
NO:50; exon 12, SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14, SEQ ID 

25 NO:53; exon 15, SEQ ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID 
NO:56; exon 18, SEQ ID NO:57; exon 19, SEQ ID NO:58; exon 20, SEQ ID 
NO:59; exon 21, SEQ ID NO:60; exon 22, SEQ ID NO:61; and exon 23; SEQ ID 
NO:62). 
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Fig. 4 shows the domain organization of Zmaxl, including the YWTD 
spacers, the extracellular attachment site, the binding site for LDL and calcium, the 
cysteine-rich growth factor repeats, the transmembrane region, the ideal PEST 
region with the CK-II phosphorylation site and the internalization domain. Fig. 4 
5 also shows the site of the glycine to valine change that occurs in the HBM protein. 
The signal peptide is located at amino acids 1-22, the extracellular domain is located 
at amino acids 23-1385, the transmembrane segment is located at amino acids 1386- 
1413, and the cytoplasmic domain is located at amino acids 1414-1615. 

Fig. 5 is a schematic illustration of the BAC contigs B527D12 and B200E21 
10 in relation to the HBM gene. 

Figs. 6A-6E are the nucleotide and amino acid sequences of the wild-type 
gene, Zmaxl . The location for the base pair substitution at nucleotide 582, a 
guanine to thymine, is underlined. This allelic variant is the HBM gene. The HBM 
gene encodes for a protein with an amino acid substitution of glycine to valine at 
15 position 171. The 5 f untranslated region (UTR) boundaries bases 1 to 70, and the 3 1 
UTR boundaries bases 4916-5120. 

Figs. 7 A and 7B are northern blot analyses showing the expression of Zmaxl 
in various tissues. 

Fig. 8 is a PCR product analysis. 
20 Fig. 9 is allele specific oligonucleotide detection of the Zmaxl exon 3 

mutation. 

Fig. 10 is the cellular localization of mouse Zmaxl by in situ hybridization at 
100X magnification using sense and antisense probes. 

Fig. 11 is the cellular localization of mouse Zmaxl by in situ hybridization at 
25 400X magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mouse Zmaxl by in situ hybridization 
of osteoblasts in the endosteum at 400X magnification using sense and antisense 
probes. 

Fig. 13 shows antisense inhibition of Zmaxl expression in MC-3T3 cells. 
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Fig. 14 shows aZmaxl Exon3 Allele Specific Oligonucleotide (ASO) assay 
which illustrates the rarity of the HBM1 allele (right panels; T-specific oligo; 58 °C 
Wash) as compared to the wild-type Zmaxl allele (left panels, G-specific oligo; 
55 °C Wash). The positive spots appearing in the right panels were positive 
5 controls. 

Fig. 15 depicts a model representing the potential role of Zmaxl in focal 
adhesion signaling. 

DETAILED DESCRIPTION OF THE INVENTION 

To aid in the understanding of the specification and claims, the following 

1 0 definitions are provided. 

"Gene" refers to a DNA sequence that encodes through its template or 
messenger RNA a sequence of amino acids characteristic of a specific peptide. The 
term "gene" includes intervening, non-coding regions, as well as regulatory regions, 
and can include 5' and 3' ends. 

1 5 "Gene sequence" refers to a DNA molecule, including both a DNA molecule 

which contains a non-transcribed or non-translated sequence. The term is also 
intended to include any combination of gene(s), gene fragment(s), non-transcribed 
sequence(s) or non-translated sequence(s) which are present on the same DNA 
molecule. 

20 The sequences of the present invention may be derived from a variety of 

sources including DNA, cDNA, synthetic DNA, synthetic RNA or combinations 
thereof. Such sequences may comprise genomic DNA which may or may not 
include naturally occurring introns. Moreover, such genomic DNA may be obtained 
in association with promoter regions or poly (A) sequences. The sequences, 

25 genomic DNA or cDNA may be obtained in any of several ways. Genomic DNA 
can be extracted and purified from suitable cells by means well known in the art. 
Alternatively, mRNA can be isolated from a cell and used to produce cDNA by 
reverse transcription or other means. 
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"cDNA" refers to complementary or copy DNA produced from an RNA 
template by the action of RNA-dependent DNA polymerase (reverse transcriptase). 
Thus, a "cDNA clone" means a duplex DNA sequence complementary to an RNA 
molecule of interest, carried in a cloning vector or PCR amplified. This term 
5 includes genes from which the intervening sequences have been removed. 

"Recombinant DNA" means a molecule that has been recombined by in vitro 
splicing cDNA or a genomic DNA sequence. 

"Cloning" refers to the use of in vitro recombination techniques to insert a 
particular gene or other DNA sequence into a vector molecule. In order to 
10 successfully clone a desired gene, it is necessary to use methods for generating DNA 
fragments, for joining the fragments to vector molecules, for introducing the 
composite DNA molecule into a host cell in which it can replicate, and for selecting 
the clone having the target gene from amongst the recipient host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules 
15 containing cDNA inserts which together comprise the entire genome of an organism. 
Such a cDNA library can be prepared by methods known to one skilled in the art 
and described by, for example, Cowell and Austin, "cDNA Library Protocols," 
Methods in Molecular Biology (1997). Generally, RNA is first isolated from the 
cells of an organism from whose genome it is desired to clone a particular gene. 
20 "Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence 

which is able to replicate in a host cell. The cloning vehicle is characterized by one 
or more endonuclease recognition sites at which such DNA sequences may be cut in 
a determinable fashion without loss of an essential biological function of the DNA, 
which may contain a marker suitable for use in the identification of transformed 
25 cells. 

"Expression control sequence" refers to a sequence of nucleotides that 
control or regulate expression of structural genes when operably linked to those 
genes. These include, for example, the lac systems, the trp system, major operator 
and promoter regions of the phage lambda, the control region of fd coat protein and 
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other sequences known to control the expression of genes in prokaryotic or 
eukaryotic cells. Expression control sequences will vary depending on whether the 
vector is designed to express the operably linked gene in a prokaryotic or eukaryotic 
host, and may contain transcriptional elements such as enhancer elements, 
5 termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

"Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle 
but which is capable of expressing a gene which has been cloned into it, after 
transformation into a host. The cloned gene is usually placed under the control of 
10 (i.e., operably linked to) an expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence that can be recognized by an RNA 
polymerase. The presence of such a sequence permits the RNA polymerase to bind 
15 and initiate transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene 
sequences which may be necessary for the initiation of transcription. The presence 
of a promoter region is sufficient to cause the expression of an operably linked gene 
sequence. 

"Operably linked" means that the promoter controls the initiation of 
expression of the gene. A promoter is operably linked to a sequence of proximal 
DNA if upon introduction into a host cell the promoter determines the transcription 
of the proximal DNA sequence(s) into one or more species of RNA. A promoter is 
operably linked to a DNA sequence if the promoter is capable of initiating 
transcription of that DNA sequence. 

"Prokaryote" refers to all organisms without a true nucleus, including 
bacteria. 

"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 
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"Host" includes prokaryotes and eukaryotes, such as yeast and filamentous 
fungi, as well as plant and animal cells. The term includes an organism or cell that 
is the recipient of a replicable expression vehicle. 

"Fragment" of a gene refers to any variant of the gene that possesses the 
5 biological activity of that gene. 

"Variant" refers to a gene that is substantially similar in structure and 
biological activity or immunological characteristics to either the entire gene or to a 
fragment of the gene. Provided that the two genes possess a similar activity, they 
are considered variant as that term is used herein even if the sequence of amino acid 
10 residues is not identical. 

"Amplification of nucleic acids" refers to methods such as polymerase chain 
reaction (PCR), ligation amplification (or ligase chain reaction, LCR) and 
amplification methods based on the use of Q-beta replicase. These methods are well 
known in the art and described, for example, in U.S. Patent Nos. 4,683,195 and 
15 4,683,202. Reagents and hardware for conducting PCR are commercially available. 
Primers useful for amplifying sequences from the HBM region are preferably 
complementary to, and hybridize specifically to sequences in the HBM region or in 
regions that flank a target region therein. HBM sequences generated by 
amplification may be sequenced directly. Alternatively, the amplified sequence(s) 
20 may be cloned prior to sequence analysis. 

"Antibodies" may refer to polyclonal and/or monoclonal antibodies and 
fragments thereof, and immunologic binding equivalents thereof, that can bind to the 
HBM proteins and fragments thereof or to nucleic acid sequences from the HBM 
region, particularly from the HBM locus or a portion thereof. The term antibody is 
25 used both to refer to a homogeneous molecular entity, or a mixture such as a serum 
product made up of a plurality of different molecular entities. Proteins may be 
prepared synthetically in a protein synthesizer and coupled to a carrier molecule and 
injected over several months into rabbits. Rabbit sera is tested for immunoreactivity 
to the HBM protein or fragment. Monoclonal antibodies may be made by injecting 



BNSDOCID: <WO 0177327A1J_> 



WO 01/77327 



PCT/US00/16951 



-17- 

mice with the proteins, or fragments thereof. Monoclonal antibodies will be 
screened by ELISA and tested for specific immunoreactivity with HBM protein or 
fragments thereof. Harlow et al, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY (1988). These antibodies will be useful 
5 in assays as well as pharmaceuticals. 

"HBM 11 refers to high bone mass. 

"HBM protein" refers to a protein that is identical to a Zmaxl protein except 
that it contains an alteration of glycine 171 to valine. An HBM protein is defined 
for any organism that encodes a Zmaxl true homologue. For example, a mouse 
10 HBM protein refers to the mouse Zmaxl protein having the glycine 170 to valine 
substitution. 

"HBM gene" refers to the genomic DNA sequence found in individuals 
showing the HBM characteristic or phenotype, where the sequence encodes the 
protein indicated by SEQ ID NO: 4. The HBM gene and the Zmaxl gene are allelic. 

15 The protein encoded by the HBM gene has the property of causing elevated bone 
mass, while the protein encoded by the Zmaxl gene does not. The HBM gene and 
the Zmaxl gene differ in that the HBM gene has a thymine at position 582, while 
the Zmaxl gene has a guanine at position 582. The HBM gene comprises the 
nucleic acid sequence shown as SEQ ID NO: 2. The HBM gene may also be 

20 referred to as an "HBM polymorphism." 

"Normal," "wild-type," "unaffected" and "Zmaxl" all refer to the genomic 
DNA sequence that encodes the protein indicated by SEQ ID NO: 3. The Zmaxl 
gene has a guanine at position 582. The Zmaxl gene comprises the nucleic acid 
sequence shown as SEQ ED NO: L "Normal," "wild-type," "unaffected" and 

25 "Zmaxl" also refer to allelic variants of the genomic sequence that encodes proteins 
that do not contribute to elevated bone mass. The Zmaxl gene is common in the 
human population, while the HBM gene is rare. 

"5YWT+EGF" refers to a repeat unit found in the Zmaxl protein, consisting 
of five YWT repeats followed by an EGF repeat. 
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"Bone development" generally refers to any process involved in the change 
of bone over time, including, for example, normal development, changes that occur 
during disease states, and changes that occur during aging. "Bone development 
disorder" particularly refers to any disorders in bone development including, for 
5 example, changes that occur during disease states and changes that occur during 

aging. Bone development may be progressive or cyclical in nature. Aspects of bone 
that may change during development include, for example, mineralization, 
formation of specific anatomical features, and relative or absolute numbers of 
various cell types. 

10 "Bone modulation" or "modulation of bone formation" refers to the ability to 

affect any of the physiological processes involved in bone remodeling, as will be 
appreciated by one skilled in the art, including, for example, bone resorption and 
appositional bone growth, by, inter alia, osteoclastic and osteoblastic activity, and 
may comprise some or all of bone formation and development as used herein. 

15 "Normal bone density" refers to a bone density within two standard 

deviations of a Z score of 0. 

A "Zmaxl system" refers to a purified protein, cell extract, cell, animal, 
human or any other composition of matter in which Zmaxl is present in a normal or 
mutant form. 

20 A "surrogate marker" refers to a diagnostic indication, symptom, sign or 

other feature that can be observed in a cell, tissue, human or animal that is correlated 
with the HBM gene or elevated bone mass or both, but that is easier to measure than 
bone density. The general concept of a surrogate marker is well accepted in 
diagnostic medicine. 

25 The present invention encompasses the Zmaxl gene and Zmaxl protein in 

the forms indicated by SEQ ID NOS: 1 and 3, respectively, and other closely related 
variants, as well as the adjacent chromosomal regions of Zmaxl necessary for its 
accurate expression. In a preferred embodiment, the present invention is directed to 
at least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID NO: 1. 
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The present invention also encompasses the HBM gene and HBM protein in 
the forms indicated by SEQ ID NO: 2 and 4, respectively, and other closely related 
variants, as well as the adjacent chromosomal regions of the HBM gene necessary 
for its accurate expression. In a preferred embodiment, the present invention is 
5 directed to at least 15 contiguous nucleotides of the nucleic acid sequence of SEQ 
ID NO: 2. More preferably, the present invention is directed to at least 15 
contiguous nucleotides of the nucleic acid sequence of SEQ ID NO: 2, wherein one 
of the 15 contiguous nucleotides is the thymine at nucleotide 582. 

The invention also relates to the nucleotide sequence of the Zmaxl gene 
10 region, as well as the nucleotide sequence of the HBM gene region. More 

particularly, a preferred embodiment are the B AC clones containing segments of the 
Zmaxl gene region B200E21-H and B527D12-H. A preferred embodiment is the 
: nucleotide sequence of the BAC clones consisting of SEQ ID NOS: 5-12. 

The invention also concerns the use of the nucleotide sequence to identify 
15 . DNA probes for the Zmaxl gene and the HBM gene, PCR primers to amplify the 
Zmaxl gene and the HBM gene, nucleotide polymorphisms in the Zmaxl gene and 
the HBM gene, and regulatory elements of the Zmaxl gene and the HBM gene. 

This invention describes the further localization of the chromosomal location 
of the Zmaxl gene and HBM gene on chromosome 1 lql3.3 between genetic 
20 markers Dl 1S987 and SNP_CONTIG033-6, as well as the DNA sequences of the 
Zmaxl gene and the HBM gene. The chromosomal location was refined by the 
addition of more genetic markers to the mapping panel used to map the gene, and by 
the extension of the pedigree to include more individuals. The pedigree extension 
was critical because the new individuals that have been genotyped harbor critical 
25 recombination events that narrow the region. To identify genes in the region on 
1 lql3.3, a set of BAC clones containing this chromosomal region was identified. 
The BAC clones served as a template for genomic DNA sequencing, and also as a 
reagent for identifying coding sequences by direct cDNA selection. Genomic 
sequencing and direct cDNA selection were used to characterize more than 1 .5 
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million base pairs of DNA from 1 lql3.3. The Zmaxl gene was identified within 
this region and the HBM gene was then discovered after mutational analysis of 
affected and unaffected individuals. 

When a gene has been genetically localized to a specific chromosomal 
5 region, the genes in this region can be characterized at the molecular level by a 
series of steps that include: cloning of the entire region of DNA in a set of 
overlapping clones (physical mapping), characterization of genes encoded by these 
clones by a combination of direct cDNA selection, exon trapping and DNA 
sequencing (gene identification), and identification of mutations in these genes by 
1 0 comparative DNA sequencing of affected and unaffected members of the HBM 
kindred (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA 
cloned in vectors that are propagated in E. coli or S. cereviseae using PCR assays 
designed to amplify unique molecular landmarks in the chromosomal region of 
15 interest. To generate a physical map of the HBM candidate region, a library of 

human DNA cloned in Bacterial Artificial Chromosomes (BACs) was screened with 
a set of Sequence Tagged Site (STS) markers that had been previously mapped to 
chromosome 1 Iql2-ql3 by the efforts of the Human Genome Project. 

STSs are unique molecular landmarks in the human genome that can be 
20 assayed by PCR. Through the combined efforts of the Human Genome Project, the 
location of thousands of STSs on the twenty-two autosomes and two sex 
chromosomes has been determined. For a positional cloning effort, the physical 
map is tied to the genetic map because the markers used for genetic mapping can 
also be used as STSs for physical mapping. By screening a BAC library with a 
25 combination of STSs derived from genetic markers, genes, and random DNA 

fragments, a physical map comprised of overlapping clones representing all of the 
DNA in a chromosomal region of interest can be assembled. 

BACs are cloning vectors for large (80 kilobase to 200 kilobase) segments of 
human or other DNA that are propagated in E. coli. To construct a physical map 
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using BACs, a library of BAC clones is screened so that individual clones harboring 
the DNA sequence corresponding to a given STS or set of STSs are identified. 
Throughout most of the human genome, the STS markers are spaced approximately 
20 to 50 kilobases apart, so that an individual BAC clone typically contains at least 
5 two STS markers. In addition, the BAC libraries that were screened contain enough 
cloned DNA to cover the human genome six times over. Therefore, an individual 
STS typically identifies more than one BAC clone. By screening a six-fold 
coverage BAC library with a series of STS markers spaced approximately 50 
kilobases apart, a physical map consisting of a series of overlapping BAC clones, i.e. 

10 BAC contigs, can be assembled for any region of the human genome. This map is 

closely tied to the genetic map because many of the STS markers used to prepare the 
physical map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the 
STS map of the genome that result in the inability to identify BAC clones that are 

15 overlapping in a given location. Typically, the physical map is first constructed 

from a set of STSs that have been identified through the publicly available literature 
and World Wide Web resources. The initial map consists of several separate BAC 
contigs that are separated by gaps of unknown molecular distance. To identify BAC 
clones that fill these gaps, it is necessary to develop new STS markers from the ends 

20 of the clones on either side of the gap. This is done by sequencing the terminal 200 
to 300 base pairs of the BACs flanking the gap, and developing a PCR assay to 
amplify a sequence of 100 or more base pairs. If the terminal sequences are 
demonstrated to be unique within the human genome, then the new STS can be used 
to screen the BAC library to identify additional BACs that contain the DNA from 

25 the gap in the physical map. To assemble a BAC contig that covers a region the size 
of the HBM candidate region (2,000,000 or more base pairs), it is often necessary to 
develop new STS markers from the ends of several clones. 

After building a BAC contig, this set of overlapping clones serves as a 
template for identifying the genes encoded in the chromosomal region. Gene 
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identification can be accomplished by many methods. Three methods are commonly 
used: (1) a set of BACs selected from the BAC contig to represent the entire 
chromosomal region can be sequenced, and computational methods can be used to 
identify all of the genes, (2) the BACs from the BAC contig can be used as a reagent 
5 to clone cDNAs corresponding to the genes encoded in the region by a method 

termed direct cDNA selection, or (3) the BACs from the BAC contig can be used to 
identify coding sequences by selecting for specific DNA sequence motifs in a 
procedure called exon trapping. The present invention includes genes identified by 
the first two methods. 

10 To sequence the entire BAC contig representing the HBM candidate region, 

a set of BACs was chosen for subcloning into plasmid vectors and subsequent DNA 
sequencing of these subclones. Since the DNA cloned in the BACs represents 
genomic DNA, this sequencing is referred to as genomic sequencing to distinguish it 
from cDNA sequencing. To initiate the genomic sequencing for a chromosomal 

1 5 region of interest, several non-overlapping BAC clones are chosen. DNA for each 
BAC clone is prepared, and the clones are sheared into random small fragments 
which are subsequently cloned into standard plasmid vectors such as pUC18. The 
plasmid clones are then grown to propagate the smaller fragments, and these are the 
templates for sequencing. To ensure adequate coverage and sequence quality for the 

20 BAC DNA sequence, sufficient plasmid clones are sequenced to yield six-fold 

coverage of the BAC clone. For example, if the BAC is 100 kilobases long, then 
phagemids are sequenced to yield 600 kilobases of sequence. Since the BAC DNA 
was randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of 
raw DNA sequence can be assembled by computational methods into overlapping 

25 DNA sequences termed sequence contigs. For the purposes of initial gene 
identification by computational methods, six-fold coverage of each BAC is 
sufficient to yield ten to twenty sequence contigs of 1000 base pairs to 20,000 base 
pairs. 
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The sequencing strategy employed in this invention was to initially sequence 
"seed" B ACs from the B AC contig in the HBM candidate region. The sequence of 
the "seed" BACs was then used to identify minimally overlapping B ACs from the 
contig, and these were subsequently sequenced. In this manner, the entire candidate 
5 region was sequenced, with several small sequence gaps left in each BAC. This 

sequence served as the template for computational gene identification. One method 
for computational gene identification is to compare the sequence of BAC contig to 
publicly available databases of cDNA and genomic sequences, e.g. unigene, dbEST, 
genbank. These comparisons are typically done using the BLAST family of 

10 computer algorithms and programs (Altschul et al, J. Mol. Biol., 215:403-410 

(1990)). The BAC sequence can also be translated into protein sequence, and the 
protein sequence can be used to search publicly available protein databases, using a 
version of BLAST designed to analyze protein sequences (Altschul et al, Nucl. Acids 
Res., 25:3389-3402 (1997)). Another method is to use computer algorithms such as 

15 MZEF (Zhang, Proc. Natl Acad. Set, 94:565-568 (1997)) and GRAIL (Uberbacher 
et al, Methods EnzymoL, 266:259-281 (1996)), which predict the location of exons 
in the sequence based on the presence of specific DNA sequence motifs that are 
common to all exons, as well as the presence of codon usage typical of human 
protein encoding sequences. 

20 In addition to identifying genes by computational methods, genes were also 

identified by direct cDNA selection (Del Mastro et al, Genome Res. 5(2): 185-194 
(1995)). In direct cDNA selection, cDNA pools from tissues of interest are 
prepared, and the BACs from the candidate region are used in a liquid hybridization 
assay to capture the cDNAs which base pair to coding regions in the BAC. In the 

25 methods described herein, the cDNA pools were created from several different 

tissues by random priming the first strand cDNA from polyA RNA, synthesizing the 
second strand cDNA by standard methods, and adding linkers to the ends of the 
cDNA fragments. The linkers are used to amplify the cDNA pools. The BAC 
clones are used as a template for in vitro DNA synthesis to create a biotin labelled 
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copy of the BAC DNA. The biotin labelled copy of the BAC DNA is then 
denatured and incubated with an excess of the PCR amplified, linkered cDNA pools 
which have also been denatured. The BAC DNA and cDNA are allowed to anneal 
in solution, and heteroduplexes between the BAC and the cDNA are isolated using 
streptavidin coated magnetic beads. The cDNAs that are captured by the BAC are 
then amplified using primers complimentary to the linker sequences, and the 
hybridization/selection process is repeated for a second round. After two rounds of 
direct cDNA selection, the cDNA fragments are cloned, and a library of these direct 
selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two methods. 
Since a pool of BACs from the HBM candidate region is used to provide the 
genomic DNA sequence, the cDNAs must be mapped to individual BACs. This is 
accomplished by arraying the BACs in microtiter dishes, and replicating their DNA 
in high density grids. Individual cDNA clones are then hybridized to the grid to 
confirm that they have sequence identity to an individual BAC from the set used for 
direct selection, and to determine the specific identity of that BAC. cDNA clones 
that are confirmed to correspond to individual BACs are sequenced. To determine 
whether the cDNA clones isolated by direct selection share sequence identity or 
similarity to previously identified genes, the DNA and protein coding sequences are 
compared to publicly available databases using the BLAST family of programs. 

The combination of genomic DNA sequence and cDNA sequence provided 
by BAC sequencing and by direct cDNA selection yields an initial list of putative 
genes in the region. The genes in the region were all candidates for the HBM locus. 
To further characterize each gene, Northern blots were performed to determine the 
size of the transcript corresponding to each gene, and to determine which putative 
exons were transcribed together to make an individual gene. For Northern blot 
analysis of each gene, probes were prepared from direct selected cDNA clones or by 
PCR amplifying specific fragments from genomic DNA or from the BAC encoding 
the putative gene of interest. The Northern blots gave information on the size of the 
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transcript and the tissues in which it was expressed. For transcripts which were not 
highly expressed, it was sometimes necessary to perform a reverse transcription 
PCR assay using RNA from the tissues of interest as a template for the reaction. 

Gene identification by computational methods and by direct cDNA selection 
provides unique information about the genes in a region of a chromosome. When 
genes are identified, then it is possible to examine different individuals for mutations 
in each gene. 

I. Phenotyping using DXA Measurements 

Spinal bone mineral content (BMC) and bone mineral density (BMD) 
measurements performed at Creighton University (Omaha, Nebraska) were made by 
DXA using a Norland Instruments densitometer (Norland XR2600 Densitometer, 
Dual Energy X-ray Absorptiometry, DXA). Spinal BMC and BMD at other 
locations used the machinery available. There are estimated to be 800 DXA 
machines currently operating in the U.S. Most larger cities have offices or imaging 
centers which have DXA capabilities, usually a Lunar or Hologic machine. Each 
location that provided spine BMC and BMD data included copies of the printouts, 
from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete clinical histories 
and skeletal radiographs were obtained. 

The HBM phenotype is defined by the following criteria: very high spinal 
BMD; a clinical history devoid of any known high bone mass syndrome; and 
skeletal radiographs showing a normal shape of the appendicular skeleton. 

II. Genotyping of Microsatellite Markers 

To narrow the genetic interval to a region smaller than that originally 
reported by Johnson et al, Am. J. Hum. Genet, 60:1326-1332 (1997), additional 
microsatellite markers on chromosome 1 lql2-13 were typed. The new markers 
included: D11S4191, D11S1883, D11S1785, D11S4113, D11S4136, D11S4139, 
(Dib, et al, Nature, 380:152-154 (1996), FGF3 (Polymeropolous, et al, NucL Acid 
Res., 18:7468 (1990)), as well as GTC_HBM__Marker_l, GTC„HBM_Marker_2, 
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GTC_HBM_Marker_3 , GTC_HBM_Marker_4, GTCJHBM__Marker_5, 
GTC_HBM__Marker_6, and GTC_HBM_Marker__7 {See Fig. 2). 

Blood (20 ml) was drawn into lavender cap (EDTA containing) tubes by a 
certified phlebotomist. The blood was stored refrigerated until DNA extraction. 
DNA has been extracted from blood stored for up to 7 days in the refrigerator 
without reduction in the quality or quantity of yield. For those subjects that have 
blood drawn at distant sites, a shipping protocol was successfully used on more than 
a dozen occasions. Blood samples were shipped by overnight express in a 
styrofoam container with freezer packs to provide cooling. Lavender cap tubes were 
placed on individual plastic shipping tubes and then into "zip-lock" biohazard bags. 
When the samples arrived the next day, they were immediately processed to extract 
DNA. 

The DNA extraction procedure used a kit purchased from Gentra Systems, 
Inc. (Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of 
a red blood cell lysis buffer to the whole blood. After incubations for 10 minutes at 
room temperature, the solution was centrifuged in a Beckman tabletop centrifuge at 
2,000 X g for 10 minutes. The white blood cell pellet was resuspended in Cell Lysis 
Buffer. Once the pellet was completely resuspended and free of cell clumps, the 
solution was digested with RNase A for 15 minutes at 37°C. Proteins were 
precipitated by addition of the provided Protein Precipitation Solution and removed 
by centrifugation. The DNA was precipitated out of the supernatant by addition of 
isopropanol. This method was simple and fast, requiring only 1-2 hours, and 
allowed for the processing of dozens of samples simultaneously. The yield of DNA 
was routinely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 \xg aliquots at -80°C as an ethanol 
precipitate. 

DNA was genotyped using one fluorescently labeled oligonucleotide primer 
and one unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides 
were obtained from Integrated DNA Technologies, Inc. (Coralville, Iowa). All other 
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reagents for microsatellite genotyping were purchased from Perkin Elmer- Applied 
Biosy stems, Inc. ("PE-ABI") (Norwalk, Connecticut). Individual PCR reactions 
were performed for each marker, as described by PE-ABI using AmpliTag DNA 
Polymerase. The reactions were added to 3.5 jil of loading buffer containing 
5 deionized formamide, blue dextran and TAMRA 350 size standards (PE-ABI). 

After heating at 95 °C for 5 minutes to denature the DNA, the samples were loaded 
and electrophoresed as described in the operators manual for the Model 377 DNA 
Sequencer (PE-ABI, Foster City, California). After gel electrophoresis, the data was 
analyzed using PE-ABI GENESCAN™ and GENOTYPER™ software. First, 

10 within the GENESCAN™ software, the lane tracking was manually optimized prior 
to the first step of analysis. After the gel lane data was extracted, the standard curve 
profiles of each lane were examined and verified for linearity and size calling. 
Lanes, which had problems with either of these parameters, were re-tracked and 
verified. Once all lanes were tracked and the size standards were correctly identified, 

15 the data were imported into GENOTYPER™ for allele identification To expedite 

allele calling (binning), the program Linkage Designer from the Internet web-site of , . 
Dr. Guy Van Camp (^ttp://alt.www.uia.ac.be/u/dnalab/ld.html) was used. This 
program greatly facilitates the importing of data generated by GENOTYPER™ into 
the pedigree drawing program Cyrillic (Version 2.0, Cherwell Scientific Publishing 

20 Limited, Oxford, Great Britain) and subsequent linkage analysis using the program 
LINKAGE (Lathrop et al, Am. J. Hum. Genet, 37:482-498 (1985)). 
m. Linkage Analysis 

Fig. 1 demonstrates the pedigree of the individuals used in the genetic 
linkage studies for this invention. Specifically, two-point linkage analysis was 

25 performed using the MLINK and LINKMAP components of the program 
LINKAGE (Lathrop et al, Am. J. Hum. Genet. , 37:482-498 (1985)). 
Pedigree/marker data was exported from Cyrillic as a pre-file into the Makeped 
program and converted into a suitable ped-file for linkage analysis. 
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The original linkage analysis was performed using three models: (i) an 
autosomal dominant, fully penetrant model, (ii) an autosomal dominant model with 
reduced penetrance, and (iii) a quantitative trait model. The HBM locus was 
mapped to chromosome 1 lql2-13 by analyzing DNA for linked markers from 22 
members of a large, extended kindred. A highly automated technology was used 
with a panel of 345 fluorescent markers which spanned the 22 autosomes at a 
spacing interval ranging from 6-22 cM. Only markers from this region of 
chromosome 11 showed evidence of linkage (LOD score -3.0). The highest LOD 
score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map 
position 55 in Fig, 2). The 95% confidence interval placed the HBM locus between 
markers Dl 1S905 and Dl 1S937 (map position 41-71 in Fig, 2). Haplotype analysis 
also places the Zmaxl gene in this same region. Further descriptions of the markers 
Dl 1S987, Dl 1S905, and Dl 1S937 can be found in Gyapay et al, Nature Genetics, 
Vol. 7, (1994). 

In this invention, the inventors report the nanrowing of the HBM interval to 
the region between markers Dl 1S987 and GTC_HBMJVIarker_5. These two 
markers lie between the delimiting markers from the original analysis (Dl 1S1 1S905 
and Dl 1S937) and are approximately 3 cM from one another. The narrowing of the 
interval was accomplished using genotypic data from the markers Dl 1S4191, 
Dl 1S1 883, Dl 1S1785, Dl 1S41 13, Dl 1S4136, Dl 1S4139, (Dib et al, Nature, 
380:152-154 (1996)), FGF3 (Polymeropolous et *\,NucL Acid Res., 18:7468 (1990)) 
(information about the genetic markers can be found at the internet site of the 
Genome Database, http://gdbwww.gdb.org/), as well as the markers 
GTC JHBM__Marker_l , GTC_HBM„Marker_2, GTC_HBM_Marker_3, 
GTC_HBMJVfarker_4, GTC_HBM_Marker_5, GTC_HBM_Marker_6, and 
GTC_HBM__Marker_7. 

As shown in Fig. 1, haplotype analysis with the above genetic markers 
identifies recombination events (crossovers) in individuals 9019 and 9020 that 
significantly refine the interval of chromosome 1 1 to which the Zmaxl gene is 
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localized. Individual 9019 is an HBM-affected individual that inherits a portion of 
chromosome 1 1 from the maternal chromosome with the HBM gene, and a portion 
from the chromosome 1 1 homologue. The portion inherited from the HBM gene- 
carrying chromosome includes markers D11S935, D11S1313, 
5 GTC_HBM_Marker_4 5 Dl 1S987, Dl 1S1296, GTC_HBM_Marker_6, 
GTC_HBM_Marker_2, Dl 1S970, GTC_HBM_Marker_3 3 Dl 1S41 13, 
GTC_HBM_Marker_l , GTCJHBM_Marker_7 and GTC_HBM_Marker_5 . The 
portion from Dl 1S4136 and continuing in the telomeric direction is derived from the 
non-HBM chromosome. This data places the Zmaxl gene in a location centromeric 
10 to the marker GTCJHBM_Marker_5. Individual 9020 is an unaffected individual 
who also exhibits a critical recombination event. This individual inherits a 
recombinant paternal chromosome 11 that includes markers Dl 1S935, D11S1313, 
GTC_HBM__Marker_4, Dl 1S987, Dl IS 1296 and GTC_HBM_Marker_6 from her 
father's (individual 0115) chromosome 11 homologue that carries the HBM gene, 
15 and markers GTCjHBM_Marker_2, Dl 1S970, GTC_HBM_Marker_3 , 

GTC JHBM__Marker_l , GTC Jffi3M_Marker_7 , GTC_HBM_Marker_5 , D11S4136, 
Dl 1S4139, Dl 1S1314, and Dl 1S937 from her father's chromosome 1 1 that does not 
carry the HBM gene. Marker Dl 1S41 13 is uninformative due to its homozygous 
nature in individual 0115. This recombination event places the centromeric 
20 boundary of the HBM region between markers Dl 1S1296 and Dl 1S987. 

Two-point linkage analysis was also used to confirm the location of the 
Zmaxl gene on chromosome 11. The linkage results for two point linkage analysis 
under a model of full penetrance are presented in Table 1 below. This table lists the 
genetic markers in the first column and the recombination fractions across the top of 
the table. Each cell of the column shows the LOD score for an individual marker 
tested for linkage to the Zmaxl gene at the recombination fraction shown in the first 
row. For example, the peak LOD score of 7.66 occurs at marker Dl 1S970, which is 
within the interval defined by haplotype analysis. 
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TABLE1 



Marker 


0.0 


0.05 


0.1 


0.15 


0.2 


0.25 


0.3 


0.35 


0.4 


D11S935 


- infinity 


0.39 


0.49 


0.47 


0.41 


0.33 


0.25 


0.17 


0.10 


D11S1313 


- infinity 


2.64 


2.86 


2.80 


2.59 


2.30 


1.93 


1.49 


1.00 


D11S987 


- infinity 


5.49 


5.18 


4.70 


4.13 


3.49 


2.79 


2.03 


1.26 


D11S4113 


4.35 


3.99 


3.62 


3.24 


2.83 


2.40 


1.94 


1.46 


0.97 


D11S1337 


2.29 


2.06 


1.81 


1.55 


1.27 


0.99 


0.70 


0.42 


0.18 


D11S970 


7.66 


6.99 


6.29 


5.56 


4.79 


3.99 


3.15 


2.30 


1.44 


D11S4136 


6.34 


5.79 


5.22 


4.61 


3.98 


3.30 


2.59 


1.85 


1.11 , 


D11S4139 


6.80 


6.28 


5.73 


5.13 


4.50 


3.84 


3.13 


2.38 


1.59 


FGF3 


0.59 


3.23 


3.15 


2.91 


2.61 


2.25 


1.84 


1.40 


0.92 


D11S1314 


6.96 


6.49 


5.94 


5.34 


4.69 


4.01 


3.27 


2.49 


1.67 


D11S937 


-infinity 


4.98 


4.86 


4.52 


4.06 


3.51 


2.88 


2.20 


1.47 



A single nucleotide polymorphism (SNP) further defines the HBM region. 

1 5 This SNP is termed SNP_Contig033-6 and is located 25 kb centromeric to the 

genetic marker GTC_HBM_Marker_5. This SNP is telomeric to the genetic marker 
GTC_HBM_Marker_7. SNP_Contig033-6 is present in HBM-affected individual 
01 13. However, the HBM-affected individual 9019, who is the son of 01 13, does 
not carry this SNP. Therefore, this indicates that the crossover is centromeric to this 

20 SNP. The primer sequence for the genetic markers GTC_HBM_Marker_5 and 
GTC_HBM_Marker_7 is shown in Table 2 below. 



TABLE 2 



Marker 


Primer (Forward) 


Primer (Reverse) 


GTC_HBM_ 
Marker_5 


TTTTGGGTACACAATTCAGTCG 


AAAACTGTGGGTGCTTCTGG 


GTCJHBM_ 
Marker 7 


GTGATTGAGCCAATCCTGAGA 


TGAGCCAAATAAACCCCTTCT 
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The kindred described have several features of great interest, the most 
important being that their bones, while very dense, have an absolutely normal shape. 
The outer dimensions of the skeletons of the HBM-affected individuals are normal, 
and, while medullary cavities are present, there is no interference with 
5 hematopoiesis. The HBM-affected members seem to be resistant to fracture, and 
there are no neurologic symptoms, and no symptoms of impairment of any organ or 
system function in the members examined. HBM-affected members of the kindred 
live to advanced age without undue illness or disability. Furthermore, the HBM 
phenotype matches no other bone disorders such as osteoporosis, osteoporosis 

10 pseudoglioma, Engelmann's disease, Ribbing's disease, hyperphosphatasemia, Van 
Buchem's disease, melorheostosis, osteopetrosis, pycnodysostosis, sclerostenosis, 
osteopoikilosis, acromegaly, Paget's disease, fibrous dysplasia, tubular stenosis, 
osteogenesis imperfecta, hypoparathyroidism, pseudohypoparathyroidism, 
pseudopseudohypoparathyroidism, primary and secondary hyperparathyroidism and 

15 associated syndromes, hypercalciuria, medullary carcinoma of the thyroid gland, 
osteomalacia and other diseases. Clearly, the HBM locus in this family has a very 
powerful and substantial role in regulating bone density, and its identification is an 
important step in understanding the pathway(s) that regulate bone density and the 
pathogenesis of diseases such as osteoporosis. 

20 In addition, older individuals carrying the HBM gene, and therefore 

expression of the HBM protein, do not show loss of bone mass characteristic of 
normal individuals. In other words, the HBM gene is a suppressor of osteoporosis. 
In essence, individuals carrying the HBM gene are dosed with the HBM protein, 
and, as a result, do not develop osteoporosis. This in vivo observation is strong 

25 evidence that treatment of normal individuals with the HBM gene or protein, or a 
fragment thereof, will ameliorate osteoporosis. 
IV. Physical Mapping 

To provide reagents for the cloning and characterization of the HBM locus, 
the genetic mapping data described above were used to construct a physical map of 
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the region containing Zmaxl on chromosome 1 lql3.3. The physical map consists 
of an ordered set of molecular landmarks, and a set of BAC clones that contain the 
Zmaxl gene region from chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify 
5 existing STS markers (Olson et al, Science, 245: 1434-1435 (1989)) in the HBM 
region. Resources included the GDB, the Whitehead Institute Genome Center, 
dbSTS and dbEST (NCBI), lldb, the University of Texas Southwestern GESTEC, 
the Stanford Human Genome Center, and several literature references (Courseaux et 
al, Genomics, 40:13-23 (1997), Courseaux et al, Genomics, 37:354-365 (1996), 

10 Guru et al, Genomics, 42:436-445 (1997), Hosoda et al, Genes Cells, 2:345-357 
(1997), James et al, Nat Genet, 8:70-76 (1994), Kitamura et al, DNA Research, 
4:281-289 (1997), Lemmens et al, Genomics, 44:94-100 (1997), Smith et al, 
Genome Res., 7:835-842 (1997)). Maps were integrated manually to identify 
markers mapping to the region containing Zmaxl. 

15 Primers for existing STSs were obtained from the GDB or literature 

references are listed in Table 3 below. Thus, Table 3 shows the STS markers used 
to prepare the physical map of the Zmaxl gene region. 
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Novel STSs were developed either from publicly available genomic 
sequence or from sequence-derived BAC insert ends. Primers were chosen using a 
script which automatically performs vector and repetitive sequence masking using 
Cross_match (P. Green, U. of Washington) and subsequent primer picking using 
5 Primer3 (Rozen, Skaletsky (1996, 1997). Primer3 is available at 
www.genome.wi.mit. edu/genome_software/other/primer3 .html. 

Polymerase chain reaction (PCR) conditions for each primer pair were 
initially optimized with respect to MgCl 2 concentration. The standard buffer was 10 
mM Tris-HCl (pH 8.3), 50 mM KC1, MgCl 2 , 0.2 mM each dNTP, 0.2 nM each 

10 primer, 2.7 ng/^1 human DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgCl 2 
concentrations of 1 .0 mM, 1.5 mM, 2.0 mM or 2.4 mM. Cycling conditions 
included an initial denaturation at 94°C for 2 minutes followed by 40 cycles at 94°C 
for 15 seconds, 55 °C for 25 seconds, and 72°C for 25 seconds followed by a final 
extension at 72 °C for 3 minutes. Depending on the results from the initial round of 

15 optimization the conditions were further optimized if necessary. Variables included 
increasing the annealing temperature to 58 °C or 60 °C, increasing the cycle number 
to 42 and the annealing and extension times to 30 seconds, and using AmpliTaqGold 
(Perkin Elmer). 

BAC clones (Kim et al, Genomics, 32:213-218 (1996), Shizuya et al, Proc. 

20 Natl Acad. ScL USA, 89:8794-8797 (1992)) containing STS markers of interest 
were obtained by PCR-based screening of DNA pools from a total human BAC 
library purchased from Research Genetics. DNA pools derived from library plates 
1-596 were used corresponding to nine genomic equivalents of human DNA. The 
initial screening process involved PCR reactions of individual markers against 

25 superpools, i.e., a mixture of DNA derived from all BAC clones from eight 384-well 
library plates. For each positive superpool, plate (8), row (16) and column (24) 
pools were screened to identify a unique library address. PCR products were 
electrophoresed in 2% agarose gels (Sigma) containing 0.5 ng/ml ethidium bromide 
in IX TBE at 150 volts for 45 min. The electrophoresis units used were the Model 

30 A3-1 systems from Owl Scientific Products. Typically, gels contained 10 tiers of 
lanes with 50 wells/tier. Molecular weight markers (100 bp ladder, Life 
Technologies, Bethesda, MD) were loaded at both ends of the gel. Images of the 
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gels were captured with a Kodak DC40 CCD camera and processed with Kodak ID 
software. The gel data were exported as tab delimited text files; names of the files 
included information about the library screened, the gel image files and the marker 
screened. These data were automatically imported using a customized Perl script 
5 into Filemaker™ PRO (Claris Corp.) databases for data storage and analysis. In 
cases where incomplete or ambiguous clone address information was obtained, 
additional experiments were performed to recover a unique, complete library 
address. 

Recovery of clonal BAC cultures from the library involved streaking out a 

0 sample from the library well onto LB agar (Maniatis et al, Molecular Cloning: A 
Labor atoiy Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1982)) containing 12.5 |xg/ml chloramphenicol (Sigma). Two individual colonies 
and a portion of the initial streak quadrant were tested with appropriate STS markers 
by colony PCR for verification. Positive clones were stored in LB broth containing 

5 12.5 jag/ml chloramphenicol and 15% glycerol at -70°C. 

Several different types of DNA preparation methods were used for isolation 
of BAC DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis 
et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY (1982)) was successfully used for most applications, i.e., 

[) restriction mapping, CHEF gel analysis, FISH mapping, but was not successfully 
reproducible in endsequencing. The Autogen and Qiagen protocols were used 
specifically for BAC DNA preparation for endsequencing purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 fig/ml 
chloramphenicol in a 50 ml conical tube at 37° C for 20 hrs with shaking at 300 rpm. 

5 The cultures were centrifuged in a Sorvall RT 6000 D at 3000 rpm (-1800 g) at 4°C 
for 15 min. The supernatant was then aspirated as completely as possible. In some 
cases cell pellets were frozen at -20 °C at this step for up to 2 weeks. The pellet was 
then vortexed to homogenize the cells and minimize clumping. 250 \A of PI 
solution (50 mM glucose, 15 mM Tris-HCl, pH 8, 10 mM EDTA, and 100 ^ig/ml 

3 RNase A) was added and the mixture pipetted up and down to mix. The mixture 
was then transferred to a 2 ml Eppendorf tube. 350 |il of P2 solution (0.2 N NaOH, 
1% SDS) was then added, the mixture mixed gently and incubated for 5 min. at 
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room temperature, 350 jllI of P3 solution (3 M KOAc, pH 5.5) was added and the 
mixture mixed gently until a white precipitate formed. The solution was incubated 
on ice for 5 min. and then centrifuged at 4°C in a microfuge for 10 min. The 
supernatant was transferred carefully (avoiding the white precipitate) to a fresh 2 ml 
5 Eppendorf tube, and 0.9 ml of isopropanol was added, the solution mixed and left on 
ice for 5 min. The samples were centrifuged for 10 min., and the supernatant 
removed carefully. Pellets were washed in 70% ethanol and air dried for 5 min. 
Pellets were resuspended in 200 |xl of TE8 (10 mM Tris-HCl, pH 8.0, 1.0 mM 
EDTA), and RNase A (Boehringer Mannheim) added to 100 fig/ml. Samples were 

10 incubated at 37°C for 30 min., then precipitated by addition of C 2 H 3 0 2 Na-3H 2 0 to 
0.5 M and 2 volumes of ethanol. Samples were centrifuged for 10 min., and the 
pellets washed with 70% ethanol followed by air drying and dissolving in 50 fA 
TE8. Typical yields for this DNA prep were 3-5 jag/15 ml bacterial culture. Ten to 
15 |^1 were used for HindHI restriction analysis; 5 pi was used for NotI digestion and 

1 5 clone insert sizing by CHEF gel electrophoresis. 

BACs were inoculated into 15 ml of 2X LB Broth containing 12.5 pg/ml 
chloramphenicol in a 50 ml conical tube. 4 tubes were inoculated for each clone. 
Cultures were grown overnight (-16 hr) at 37°C with vigorous shaking (>300 rpm). 
Standard conditions for B AC DNA isolation were followed as recommended by the 

20 Autogen 740 manufacturer. 3 ml samples of culture were placed into Autogen tubes 
for a total of 60 ml or 20 tubes per clone. Samples were dissolved finally in 100 jil 
TE8 with 15 seconds of shaking as part of the Autogen protocol. After the Autogen 
protocol was finished DNA solutions were transferred from each individual tube and 
pooled into a 2 ml Eppendorf tube. Tubes with large amounts of debris (carry over 

25 from the pelleting debris step) were avoided. The tubes were then rinsed with 0.5 
ml of TE8 successively and this solution added to the pooled material. DNA 
solutions were stored at 4°C; clumping tended to occur upon freezing at -20°C. 
This DNA was either used directly for restriction mapping, CHEF gel analysis or 
FISH mapping or was further purified as described below for use in endsequencing 

30 reactions. 

The volume of DNA solutions was adjusted to 2 ml with TE8, samples were 
then mixed gently and heated at 65 °C for 10 min. The DNA solutions were then 
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centrifuged at 4 °C for 5 min. and the supernatants transferred to a 15 ml conical 
tube. The NaCl concentration was then adjusted to 0.75 M (~0.3 ml of 5 M NaCl to 
the 2 ml sample). The total volume was then adjusted to 6 ml with Qiagen column 
equilibration buffer (Buffer QBT). The supernatant containing the DNA was then 
5 applied to the column and allowed to enter by gravity flow. Columns were washed 
twice with 10 ml of Qiagen Buffer QC. Bound DNA was then eluted with four 
separate 1 ml aliquots of Buffer QF kept at 65 °C. DNA was precipitated with 0.7 
volumes of isopropanol (—2.8 ml). Each sample was then transferred to 4 individual 
2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 

10 Samples were centrifuged in a microfuge for 10 min. at 4°C. The supernatant was 
removed carefully and 1 ml of 70% ethanol was added. Samples were centrifuged 
again and because the DNA pellets were often loose at this stage, the supernatant 
removed carefully. Samples were centrifuged again to concentrate remaining liquid 
which was removed with a micropipet tip. DNA pellets were then dried in a 

15 desiccator for 10 min. 20 |il of sterile distilled and deionized H 2 0 was added to each 
tube which was then placed at 4°C overnight. The four 20 jutl samples for each clone 
were pooled and the tubes rinsed with another 20 \xl of sterile distilled and deionized 
H 2 0 for a final volume of 100 Samples were then heated at 65 °C for 5 min. and 
then mixed gently. Typical yields were 2-5 \xg/60 ml culture as assessed by NotI 

20 digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 \xg/ml of chloramphenicol was dispensed 
into autoclaved Autogen tubes. A single tube was used for each clone. For 
inoculation, glycerol stocks were removed from -70 °C storage and placed on dry 
ice. A small portion of the glycerol stock was removed from the original tube with a 

25 sterile toothpick and transferred into the Autogen tube; the toothpick was left in the 
Autogen tube for at least two minutes before discarding. After inoculation the tubes 
were covered with tape making sure the seal was tight. When all samples were 
inoculated, the tube units were transferred into an Autogen rack holder and placed 
into a rotary shaker at 37 °C for 16-17 hours at 250 rpm. Following growth, 

30 standard conditions for BAC DNA preparation, as defined by the manufacturer, 

were used to program the Autogen. Samples were not dissolved in TE8 as part of 
the program and DNA pellets were left dry. When the program was complete, the 
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tubes were removed from the output tray and 30 \xl of sterile distilled and deionized 
H 2 0 was added directly to the bottom of the tube. The tubes were then gently 
shaken for 2-5 seconds and then covered with parafilm and incubated at room 
temperature for 1-3 hours. DNA samples were then transferred to an Eppendorf tube 
5 and used either directly for sequencing or stored at 4°C for later use. 
V, BAC Clone Characterization for Physical Mapping 

DNA samples prepared either by manual alkaline lysis or the Autogen 
protocol were digested with Hindlll for analysis of restriction fragment sizes. This 
data were used to compare the extent of overlap among clones. Typically 1-2 \ig 

10 were used for each reaction. Reaction mixtures included: IX Buffer 2 (New 

England Biolabs), 0.1 mg/ml bovine serum albumin (New England Biolabs), 50 
|ng/ml RNase A (Boehringer Mannheim), and 20 units of HindlH (New England 
Biolabs) in a final volume of 25 |xl. Digestions were incubated at 37 °C for 4-6 
hours. BAC DNA was also digested with NotI for estimation of insert size by 

15 CHEF gel analysis (see below). Reaction conditions were identical to those for 

Hindm except that 20 units of NotI were used. Six jitl of 6X Ficoll loading buffer 
containing bromphenol blue and xylene cyanol was added prior to electrophoresis. 

Hindin digests were analyzed on 0.6% agarose (Seakem, FMC Bioproducts) 
in IX TBE containing 0.5 |ng/ml ethidium bromide. Gels (20 cm X 25 cm) were 

20 electrophoresed in a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 
20-24 hrs. Molecular weight size markers included undigested lambda DNA, 
HindDI digested lambda DNA, and HaeHI digested _X174 DNA. Molecular weight 
markers were heated at 65 °C for 2 min. prior to loading the gel. Images were 
captured with a Kodak DC40 CCD camera and analyzed with Kodak ID software. 

25 NotI digests were analyzed on a CHEF DRII (BioRad) electrophoresis unit 

according to the manufacturer's recommendations. Briefly, 1% agarose gels 
(BioRad pulsed field grade) were prepared in 0.5X TBE, equilibrated for 30 minutes 
in the electrophoresis unit at 14°C, and electrophoresed at 6 volts/cm for 14 hrs with 
circulation. Switching times were ramped from 10 sec to 20 sec. Gels were stained 

30 after electrophoresis in 0.5 |ig/ml ethidium bromide. Molecular weight markers 
included undigested lambda DNA, Hindin digested lambda DNA, lambda ladder 
PFG ladder, and low range PFG marker (all from New England Biolabs). 
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B AC DNA prepared either by the manual alkaline lysis or Autogen protocols 
were labeled for FISH analysis using a Bioprime labeling kit (BioRad) according to 
the manufacturer's recommendation with minor modifications. Approximately 200 
ng of DNA was used for each 50 jal reaction. 3 jul were analyzed on a 2% agarose 
5 gel to determine the extent of labeling. Reactions were purified using a Sephadex 
G50 spin column prior to in situ hybridization. Metaphase FISH was performed as 
described (Ma et al, Cytogenet Cell Genet, 74:266-271 (1996)). 
VI. BAC Endsequencing 

The sequencing of BAC insert ends utilized DNA prepared by either of the 

10 two methods described above. The DYEnamic energy transfer primers and 

Dynamic Direct cycle sequencing kits from Amersham were used for sequencing 
reactions. Ready made sequencing mix including the Ml 3 -40 forward sequencing ; 
primer was used (Catalog # US79730) for the T7 BAC vector terminus; ready made 
sequencing mix (Catalog # US79530) was mixed with the Ml 3 -28 reverse 

1 5 sequencing primer (Catalog # US79339) for the SP6 BAC vector terminus. The 
sequencing reaction mixes included one of the four fluorescently labeled dye- 
primers, one of the four dideoxy termination mixes, dNTPs, reaction buffer, and 
Thermosequenase. For each BAC DNA sample, 3 jil of the BAC DNA sample was 
aliquoted to 4 PCR strip tubes. 2 juil of one of the four dye primer/termination mix 

20 combinations was then added to each of the four tubes. The tubes were then sealed 
and centrifuged briefly prior to PCR. Thermocycling conditions involved a 1 
minute denaturation at 95° C, 15 second annealing at 45 °C, and extension for 1 
minute at 70 °C for 35 total cycles. After cycling the plates were centrifuged briefly 
to collect all the liquid to the bottom of the tubes. 5 fxl of sterile distilled and 

25 deionized H 2 0 was then added into each tube, the plates sealed and centrifuged 

briefly again. The four samples for each BAC were then pooled together. DNA was 
then precipitated by adding 1.5 ^1 of 7.5 M NH 4 OAc and 100 \i\ of -20°C 100% 
ethanol to each tube. Samples were mixed by pipetting up and down once. The 
plates were then sealed and incubated on ice for 10 minutes. Plates were centrifuged 

30 in a table top Haraeus centrifuge at 4000 rpm (3,290 g) for 30 minutes at 4°C to 
recover the DNA. The supernatant was removed and excess liquid blotted onto 
paper towels. Pellets were washed by adding 100 \x\ of -20 °C 70% ethanol into each 
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tube and recentrifuging at 4000 rpm (3,290 g) for 10 minutes at 4°C. The 
supernatant was removed and excess liquid again removed by blotting on a paper 
towel. Remaining traces of liquid were removed by placing the plates upside down 
over a paper towel and centrifuging only until the centrifuge reached 800 rpm. 
5 Samples were then air dried at room temperature for 30 min. Tubes were capped 
and stored dry at -20 °C until electrophoresis. Immediately prior to electrophoresis 
the DNA was dissolved in 1.5 jliI of Amersham loading dye. Plates were then sealed 
and centrifuged at 2000 rpm (825 g). The plates were then vortexed on a plate 
shaker for 1-2 minutes. Samples were then recentrifuged at 2000 rpm (825 g) 
10 briefly. Samples were then heated at 65 °C for 2 min. and immediately placed on 

ice. Standard gel electrophoresis was performed on ABI 377 fluorescent sequencers 

according to the manufacturer's recommendation. 

VII. Sub-cloning and Sequencing of HBM B AC DNA 

The physical map of the Zmaxl gene region provides a set of BAC clones 
15 that contain within them the Zmaxl gene and the HBM gene. DNA sequencing of 
several of the B ACs from the region has been completed. The DNA sequence data 
is a unique reagent that includes data that one skilled in the art can use to identify the 
Zmaxl gene and the HBM gene, or to prepare probes to identify the gene(s), or to 
identify DNA sequence polymorphisms that identify the gene(s). 
20 BAC DNA was isolated according to one of two protocols, either a Qiagen 

purification of BAC DNA (Qiagen, Inc. as described in the product literature) or a 
manual purification which is a modification of the standard alkaline lysis/Cesium 
Chloride preparation of plasmid DNA (see e.g., Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons (1997)). Briefly for the manual protocol, 
25 cells were pelleted, resuspended in GTE (50 mM glucose, 25 mM Tris-Cl (pH 8), 10 
mM EDTA) and lysozyme (50 mg/ml solution), followed by NaOH/SDS (1% 
SDS/0.2 N NaOH) and then an ice-cold solution of 3 M KOAc (pH 4.5-4.8). 
RnaseA was added to the filtered supernatant, followed by Proteinase K and 20% 
SDS. The DNA was then precipitated with isopropanol, dried and resuspended in 
30 TE (1 0 mM Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further purified by 
Cesium Chloride density gradient centrifugation (Ausubel et al, Current Protocols 
in Molecular Biology, John Wiley & Sons (1997)). 
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Following isolation, the B AC DNA was sheared hydrodynamically using an 
HPLC (Hengen, Trends in Biochem. ScL, 22:273-274 (1997)) to an insert size of 
2000-3000 bp. After shearing, the DNA was concentrated and separated on a 
standard 1% agarose gel. A single fraction, corresponding to the approximate size, 
5 was excised from the gel and purified by electroelution (Sambrook et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring, NY 
(1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA 
polymerase. The blunt-ended DNA was then ligated to unique BstXI-linker adapters 

10 (5' GTCTTCACCACGGGG and 5 f GTGGTGAAGAC in 100-1000 fold molar 
excess). These linkers were complimentary to the BstXI-cut pMPX vectors 
(constructed by the inventors), while the overhang was not self-complimentary. 
Therefore, the linkers would not concatemerize nor would the cut-vector religate 
itself easily. The linker-adapted inserts were separated from the unincorporated 

15 linkers on a 1% agarose gel and purified using GeneClean (BIO 101, Inc.). The 

linker-adapted insert was then ligated to a modified pBlueScript vector to construct a 
"shotgun" subclone library. The vector contained an out-of-frame lacZ gene at the 
cloning site which became in-frame in the event that an adapter-dimer is cloned, 
allowing these to be avoided by their blue-color. 

20 All subsequent steps were based on sequencing by ABI377 automated DNA 

sequencing methods. Only major modifications to the protocols are highlighted. 
Briefly, the library was then transformed into DH5a competent cells (Life 
Technologies, Bethesda, MD, DH5a transformation protocol). It was assessed by 
plating onto antibiotic plates containing ampicillin and IPTG/Xgal. The plates were 

25 incubated overnight at 37°C. Successful transformants were then used for plating of 
clones and picking for sequencing. The cultures were grown overnight at 37°. 
DNA was purified using a silica bead DNA preparation (Ng et al, Nucl. Acids Res., 
24:5045-5047 (1996)) method. In this manner, 25 jig of DNA was obtained per 
clone. 

30 These purified DNA samples were then sequenced using ABI dye-terrninator 

chemistry. The ABI dye terminator sequence reads were run on ABI377 machines 
and the data was directly transferred to UNIX machines following lane tracking of 
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the gels. All reads were assembled using PHRAP (P. Green, Abstracts of DOE 
Human Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157) with 
default parameters and quality scores. The initial assembly was done at 6-fold 
coverage and yielded an average of 8-15 contigs. Following the initial assembly, 
5 missing mates (sequences from clones that only gave one strand reads) were 
identified and sequenced with ABI technology to allow the identification of 
additional overlapping contigs. Primers for walking were selected using a Genome 
Therapeutics program Pick_primer near the ends of the clones to facilitate gap 
closure. These walks were sequenced using the selected clones and primers. Data 

10 were reassembled with PHRAP into sequence contigs. 

VIII. Gene Identification by Computational Methods 

Following assembly of the BAC sequences into contigs, the contigs were 
subjected to computational analyses to identify coding regions and regions bearing 
DNA sequence similarity to known genes. This protocol included the following 

15 steps. 

1 . Degap the contigs: the sequence contigs often contain symbols 
(denoted by a period symbol) that represent locations where the individual ABI 
sequence reads have insertions or deletions. Prior to automated computational 
analysis of the contigs, the periods were removed. The original data was maintained 

20 for future reference. 

2. BAC vector sequences were "masked" within the sequence by using 
the program cross match (Phil Green, 

http:\\chimera.biotech.washington.edu\UWGC). Since the shotgun libraries 
construction detailed above leaves some BAC vector in the shotgun libraries, this 
25 program was used to compare the sequence of the BAC contigs to the BAC vector 
and to mask any vector sequence prior to subsequent steps. Masked sequences were 
marked by an "X" in the sequence files, and remained inert during subsequent 
analyses. 

3. E. coli sequences contaminating the BAC sequences were masked by 
30 comparing the BAC contigs to the entire E. coli DNA sequence. 

4. Repetitive elements known to be common in the human genome were 
masked using cross match. In this implementation of crossmatch, the BAC 
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sequence was compared to a database of human repetitive elements (Jerzy Jerka, 
Genetic Information Research Institute, Palo Alto, CA). The masked repeats were 
marked by X and remained inert during subsequent analyses. 

5. The location of exons within the sequence was predicted using the 
5 MZEF computer program (Zhang, Proc. Natl Acad. Sci., 94:565-568 (1997)). 

6. The sequence was compared to the publicly available unigene 
database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 
www.ncbi.nlm.nih.gov) using the blastn2 algorithm (Altschul et al, Nucl Acids Res., 

10 25:3389-3402(1997)). The parameters for this search were: E=0.05, v=50, B=50 
(where E is the expected probability score cutoff, V is the number of database 
entries returned in the reporting of the results, and B is the number of sequence 
alignments returned in the reporting of the results (Altschul et al, J. Mol Biol, 
215:403-410(1990)). 

15 7. The sequence was translated into protein for all six reading frames, 

and the protein sequences were compared to a non-redundant protein database 
compiled from Genpept Swissprot PIR (National Center for Biotechnology 
Information, National Library of Medicine, 3 8 A, 8N905, 8600 Rockville Pike, 
Bethesda, MD 20894; www.ncbi.nlm.nih.gov). The parameters for this search were 

20 E=0.05, V=50, B= 50, where E, V, and B are defined as above. 

8. The BAC DNA sequence was compared to the database of the cDNA 
clones derived from direct selection experiments (described below) using blastn2 
(Altschul et al, Nucl Acids. Res., 25:3389-3402 (1997)). The parameters for this 
search were E=0.05, ¥==250, B=250, where E, V, and B are defined as above. 

25 9. The BAC sequence was compared to the sequences of all other BACs 

from the HBM region on chromosome 1 lql2-13 using blastn2 (Altschul et al, Nucl 
Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, 
V=50, B=50, where E, V, and B are defined as above. 

10. The BAC sequence was compared to the sequences derived from the 

30 ends of BACs from the HBM region on chromosome 1 lql2-13 using blastn2 

(Altschul et al, Nucl. Acids. Res., 25:3389-3402 (1997)). The parameters for this 
search were E=0.05, V=50, B=50, where E, V, and B are defined as above. 
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11. The BAC sequence was compared to the Genbank database (National 
Center for Biotechnology Information, National Library of Medicine, 3SA, 8N905, 
8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using blastn2 
(Altschul et al, NucL Acids. Res., 25:3389-3402 (1997)). The parameters for this 

5 search were E=0.05, V=50, B=50, where E, V, and B are defined as above. 

12. The BAC sequence was compared to the STS division of Genbank 
database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 
www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al, 1997). The parameters for this 

10 search were E=0.05, V=50, B= 50, where E, V, and B are defined as above. 

13. The BAC sequence was compared to the Expressed Sequence (EST) 
Tag Genbank database (National Center for Biotechnology Information, National 
Library of Medicine, 3 8 A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; 
www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al, NucL Acids. Res., 25:3389- 

15 3402 (1997)). The parameters for this search were E=0.05, V=250, B=250, where E, 
V, and B are defined as above. 

IX. Gene Identification by Direct cDNA Selection 

Primary linkered cDNA pools were prepared from bone marrow, calvarial 
bone, femoral bone, kidney, skeletal muscle, testis and total brain. Poly (A) + RNA 

20 was prepared from calvarial and femoral bone tissue (Chomczynski et al, Anal 
Biochem., 162:156-159 (1987); D'Alessio et al, Focus, 9:1-4 (1987)) and the 
remainder of the mRNA was purchased from Clontech (Palo Alto, California). In 
order to generate oligo(dT) and random primed cDNA pools from the same tissue, 
2.5 ^ig mRNA was mixed with oligo(dT) primer in one reaction and 2.5 jug mRNA 

25 was mixed with random hexamers in another reaction, and both were converted to 
first and second strand cDNA according to manufacturers recommendations (Life 
Technologies, Bethesda, MD). Paired phosphorylated cDNA linkers (see sequence 
below) were annealed together by mixing in a 1:1 ratio (10 ng each) incubated at 
65 °C for five minutes and allowed to cool to room temperature. 

30 Paired linkers oligol/2 

OLIGO 1 : 5'CTG AGC GGA ATT CGT GAG ACC3 1 (SEQ ID NO: 12) 
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OLIGO 2: 5'TTG GTC TCA CGT ATT CCG CTC GA3' (SEQ ID NO: 13) 

Paired linkers oligo3/4 

OLIGO 3: 5 f CTC GAG AAT TCT GGA TCC TC3' (SEQ ID NO:14) 
OLIGO 4: 5'TTG AGG ATC CAG AAT TCT CGA G3' (SEQ ID NO: 1 5) 

Paired linkers oligo5/6 

OLIGO 5: 5TGT ATG CGA ATT CGC TGC GCG3 1 (SEQ ID NO:16) 
OLIGO 6: 5TTC GCG CAG CGA ATT CGC ATA CA3' (SEQ ID NO:17) 

Paired linkers oligo7/8 

OLIGO 7: 5'GTC CAC TGA ATT CTC AGT GAG3' (SEQ ID NO: 1 8) 
OLIGO 8: 5TTG TCA CTG AGA ATT CAG TGG AC3 f (SEQ ID NO: 19) 

Paired linkers oligol 1/12 

OLIGO 1 1 : 5 f GAA TCC GAA TTC CTG GTC AGC3 1 (SEQ ED NO:20) 
OLIGO 12: 5'TTG CTG ACC AGG AAT TCG GAT TC3' (SEQ ID NO:21) 
Linkers were ligated to all oligo(dT) and random primed cDNA pools (see below) 
according to manufacturers instructions (Life Technologies, Bethesda, MD). 

Oligo 1/2 was ligated to oligo(dT) and random primed cDNA pools prepared 
from bone marrow. Oligo 3/4 was ligated to oligo(dT) and random primed cDNA 
pools prepared from calvarial bone. Oligo 5/6 was ligated to oligo(dT) and random 
primed cDNA pools prepared from brain and skeletal muscle. Oligo 7/8 was ligated 
to oligo(dT) and random primed cDNA pools prepared from kidney. Oligo 11/12 
was ligated to oligo(dT) and random primed cDNA pools prepared from femoral 
bone. 

The cDNA pools were evaluated for length distribution by PCR 
amplification using 1 pi of a 1:1, 1:10, and 1:100 dilution of the ligation reaction, 
respectively. PCR reactions were performed in a Perkiu Elmer 9600, each 25 pi 
volume reaction contained 1 pi of DNA, 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 
mM MgC12, 0.001% gelatin, 200 mM each dNTPs, 10 pM primer and 1 unit Taq 
DNA polymerase (Perkin Elmer) and was amplified under the following conditions: 
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30 seconds at 94°C, 30 seconds at 60°C and 2 minutes at 72°C for 30 cycles. The 
length distribution of the amplified cDNA pools were evaluated by electrophoresis 
on a 1% agarose gel. The PCR reaction that gave the best representation of the 
random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 [ig of 
5 each cDNA pool was produced. The starting cDNA for the direct selection reaction 
comprised of 0.5 jug of random primed cDNAs mixed with 0.5 jig of oligo(dT) 
primed cDNAs. 

The DNA from the 54 BACs that were used in the direct cDNA selection 
procedure was isolated using Nucleobond AX columns as described by the 
10 manufacturer (The Nest Group, Inc.). 

The BACs were pooled in equimolar amounts and 1 fj,g of the isolated 
genomic DNA was labeled with biotin 16-UTP by nick translation in accordance 
with the manufacturers instructions (Boehtinger Mannheim). The incorporation of 
the biotin was monitored by methods that could be practiced by one skilled in the art 
1 5 (Del Mastro and Lovett, Methods in Molecular Biology, Humana Press Inc., NJ 
(1996)). 

Direct cDNA selection was performed using methods that could be practiced 
by one skilled in the art (Del Mastro and Lovett, Methods in Molecular Biology, 
Humana Press Inc., NJ (1996)). Briefly, the cDNA pools were multiplexed in two 

20 separate reactions: In one reaction cDNA pools from bone marrow, calvarial bone, 
brain and testis were mixed, and in the other cDNA pools from skeletal muscle, 
kidney and femoral bone were mixed. Suppression of the repeats, yeast sequences 
and plasmid in the cDNA pools was performed to a Cot of 20. 100 ng of 
biotinylated BAC DNA was mixed with the suppressed cDNAs and hybridized in 

25 solution to a Cot of 200. The biotinylated DNA and the cognate cDNAs was 

captured on streptavidin-coated paramagnetic beads. The beads were washed and the 
primary selected cDNAs were eluted. These cDNAs were PCR amplified and a 
second round of direct selection was performed. The product of the second round of 
direct selection is referred to as the secondary selected material. A Galanin cDNA 

30 clone, previously shown to map to 1 lql2-13 (Evans, Genomics, 18:473-477 (1993)), 
was used to monitor enrichment during the two rounds of selection. 
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The secondary selected material from bone marrow, calvarial bone, femoral 
bone, kidney, skeletal muscle, testis and total brain was PCR amplified using 
modified primers of oligos 1, 3, 5, 7 and 1 1, shown below, and cloned into the UDG 
vector pAMPIO (Life Technologies, Bethesda, MD), in accordance with the 
manufacturer's recommendations. 
Modified primer sequences: 

Oligol-CUA: 5'CUA CUA CUA CUA CTG AGC GGA ATT CGT GAG ACC3 1 
(SEQ ID NO:22) 

OHgo3-CUA: 5 ? CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC3' 
(SEQ ID NO:23) 

OligoS-CUA: 5 ! CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG3' 
(SEQ ID NO:24) 

OHgo7-CUA: 5 f CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG3 1 
(SEQ ID NO:25) 

Oligol 1-CUA: 5'CUA CUA CUA CUA GAA TCC GAA TTC CTG GTC AGC3 1 
(SEQ ID NO:26) 

The cloned secondary selected material, from each tissue source, was 
transformed into MAX Efficiency DH5a Competent Cells (Life Technologies, 
Bethesda, MD) as recommended by the manufacturer. 384 colonies were picked 
from each transformed source and arrayed into four 96 well microtiter plates. 
All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (Applied Biosystems), and the data collected by the 
ABI 377 automated fluorescence sequencer (Applied Biosystems). 
All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et al, J. Mol Biol, 215:403-410 (1990), Altschul et al, Nucl. Acids. Res., 
25:3389-3402 (1997)). The cDNA sequences were compared to a database 
containing sequences derived from human repeats, mitochondrial DNA, ribosomal 
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RNA, E. coli DNA to remove background clones from the dataset using the program 
crossjnatch. A further round of comparison was also performed using the program 
BLASTN2 against known genes (Genbank) and the BAG sequences from the HBM 
region. Those cDNAs that were >90% homologous to these sequences were filed 
5 according to the result and the data stored in a database for further analysis. cDNA 
sequences that were identified but did not have significant similarity to the BAC 
sequences from the HBM region or were eliminated by cross match were hybridized 
to nylon membranes which contained the BACs from the HBM region, to ascertain 
whether they hybridized to the target. 

10 Hybridization analysis was used to map the cDNA clones to the BAC target 

that selected them. The BACs that were identified from the HBM region were 
arrayed and grown into a 96 well microtiter plate. LB agar containing 25 |xg/ml 
kanamycin was poured into 96 well microtiter plate lids. Once the agar had 
solidified, pre-cut Hybond N+ nylon membranes (Amersham) were laid on top of 

15 the agar and the BACs were stamped onto the membranes in duplicate using a hand 
held 96 well replica plater (V&P Scientific, Inc.). The plates were incubated 
overnight at 37°C. The membranes were processed according to the manufacturers 
recommendations. 

The cDNAs that needed to be mapped by hybridization were PCR amplified 
20 using the relevant primer (oligos 1,3,5,7 and 1 1) that would amplify that clone. 
For this PCR amplification, the primers were modified to contain a linkered 
digoxigenin molecule at the 5 f of the oligonucleotide. The PCR amplification was 
performed under the same conditions as described in Preparation of cDNA Pools 
(above). The PCR products were evaluated for quality and quantity by 
25 electrophoresis on a 1 % agarose gel by loading 5 jlcI of the PCR reaction. The nylon 
membranes containing the stamped BACs were individually pre-hybridized in 50 ml 
conical tubes containing 10 ml of hybridization solution (5x SSPE, 0.5x Blotto, 
2.5% SDS and 1 mM EDTA (pH 8.0)). The 50 ml conical tubes were placed in a 
rotisserie oven (Robbins Scientific) for 2 hours at 65 °C. Twenty-five ng of each 
30 cDNA probe was denatured and added into individual 50 ml conical tubes 

containing the nylon membrane and hybridization solution. The hybridization was 
performed overnight at 65 °C. The filters were washed for 20 minutes at 65 °C in 
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each of the following solutions: 3x SSPE, 0.1% SDS; lx SSPE, 0.1% SDS and O.lx 
SSPE, 0.1% SDS. 

The membranes were removed from the 50 ml conical tubes and placed in a 
dish. Acetate sheets were placed between each membrane to prevent them from 
5 sticking to each other. The incubation of the membranes with the Anti-DIG-AP and 
CDP-Star was performed according to manufacturers recommendations (Boehringer 
Mannheim). The membranes were wrapped in Saran wrap and exposed to Kodak 
Bio-Max X-ray film for 1 hour. 
X. cDNA Cloning and Expression Analysis 

10 To characterize the expression of the genes identified by direct cDNA 

selection and genomic DNA sequencing in comparison to the publicly available 
databases, a series of experiments were performed to further characterize the genes 
in the HBM region. First, oligonucleotide primers were designed for use in the 
polymerase chain reaction (PCR) so that portions of a cDNA, EST, or genomic 

1 5 DNA could be amplified from a pool of DNA molecules (a cDNA library) or RNA 
population (RT-PCR and RACE). The PCR primers were used in a reaction 
containing genomic DNA to verify that they generated a product of the size 
predicted based on the genomic (BAC) sequence. A number of cDNA libraries were - 
then examined for the presence of the specific cDNA or EST. The presence of a 

20 fragment of a transcription unit in a particular cDNA library indicates a high 

probability that additional portions of the same transcription unit will be present as 
well. 

A critical piece of data that is required when characterizing novel genes is 
the length, in nucleotides, of the processed transcript or messenger RNA (mRNA). 

25 One skilled in the art primarily determines the length of an mRNA by Northern blot 
hybridization (Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor NY (1989)). Groups of ESTs and 
direct-selected cDNA clones that displayed significant sequence similarity to 
sequenced BACs in the critical region were grouped for convenience into 

30 approximately 30 kilobase units. Within each 30 kilobase unit there were from one 
up to fifty ESTs and direct-selected cDNA clones which comprised one or more 
independent transcription units. One or more ESTs or direct-selected cDNAs were 
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used as hybridization probes to determine the length of the mRNA in a variety of 
tissues, using commercially available reagents (Multiple Tissue Northern blot; 
Clontech, Palo Alto, California) under conditions recommended by the 
manufacturer. 

Directionally cloned cDNA libraries from femoral bone, and calvarial bone 
tissue were constructed by methods familiar to one skilled in the art (for example, 
Soares in Automated DNA Sequencing and Analysis, Adams, Fields and Venter, 
Eds., Academic Press, NY, pages 1 10-114 (1994)). Bones were initially broken into 
fragments with a hammer, and the small pieces were frozen in liquid nitrogen and 
reduced to a powder in a tissue pulverizer (Spectrum Laboratory Products). RNA 
was extracted from the powdered bone by homogenizing the powdered bone 
with a standard Acid Guanidinium Thiocyanate-Phenol-Chloroform extraction 
buffer (e.g. Chomczynski and Szicchi, Anal. Biochenu, 162:156-159 (1987)) using a 
polytron homogenizer (Brinkman Instruments). Additionally, human brain and lung 
total RNA was purchased from Clontech. PolyA RNA was isolated from total RNA 
using dynabeads-dT according to the manufacturer's recommendations (Dynal, Inc.). 

First strand cDNA synthesis was initiated using an oligonucleotide primer 




TTTTTTTTT-3' (SEQ ID NO:27). This primer introduces a NotI restriction site 
(underlined) at the 3* end of the cDNA. First and second strand synthesis were 
performed using the "one-tube" cDNA synthesis kit as described by the anufacturer 
(Life Technologies, Bethesda, MD). Double stranded cDNAs were treated with T4 
polynucleotide kinase to ensure that the ends of the molecules were blunt (Soares in 
Automated DNA Sequencing and Analysis, Adams, Fields and Venter, Eds., 
Academic Press, NY, pages 110-1 14 (1994)), and the blunt ended cDNAs were then 
size selected by a Biogel column (Huynh et al in DNA Cloning, Vol. 1, Glover, Ed., 
IRL Press, Oxford, pages 49-78 (1985)) or with a size-sep 400 sepharose column 
(Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were 
used in subsequent steps. EcoRI adapters (sequence: 5 1 

OH-AATTCGGCACGAG-OH 3 f (SEQ ED NO:28), and 5' p-CTCGTGCCG-OH 3' 
(SEQ ID NO:29)) were then ligated to the double stranded cDNAs by methods 
familiar to one skilled in the art (Soares, 1994). The EcoRI adapters were then 




BNSDOCID: <WO. 



0177327A1_L> 



WO 01/77327 



PCT/US00/16951 



-55- 

removed from the 3' end of the cDNA by digestion with NotI (Soares, 1994). The 
cDNA was then ligated into the plasmid vector pBluescript II KS+ (Stratagene, La 
Jolla, California), and the ligated material was transformed into E. coli host DH10B 
or DH12S by electroporation methods familiar to one skilled in the art (Soares, 
5 1994). After growth overnight at 37°C, DNA was recovered from the E. coli 

colonies after scraping the plates by processing as directed for the Mega-prep kit 
(Qiagen, Chatsworth, California). The quality of the cDNA libraries was estimated 
by counting a portion of the total numbers of primary transform ants and determining 
the average insert size and the percentage of plasmids with no cDNA insert. 

10 Additional cDNA libraries (human total brain, heart, kidney, leukocyte, and fetal 
brain) were purchased from Life Technologies, Bethesda, MD. 

cDNA libraries, both oligo (dT) and random hexamer (N 6 ) primed, were used 
for isolating cDNA clones transcribed within the HBM region: human bone, human 
brain, human kidney and human skeletal muscle (all cDNA libraries were made by 

15 the inventors, except for skeletal muscle (dT) and kidney (dT) cDNA libraries). 
Four 10x10 arrays of each of the cDNA libraries were prepared as follows: the 
cDNA libraries were titered to 2.5 x 10 6 using primary transformants. The 
appropriate volume of frozen stock was used to inoculate 2 L of LB/ampicillin (100 
mg/ml). This inoculated liquid culture was aliquotted into 400 tubes of 4 ml each. 

20 Each tube contained approximately 5000 cfu. The tubes were incubated at 30°C 
overnight with gentle agitation. The cultures were grown to an OD of 0.7-0.9. 
Frozen stocks were prepared for each of the cultures by aliquotting 100 jjI of culture 
and 300 jil of 80% glycerol. Stocks were frozen in a dry ice/ethanol bath and stored 
at -70 °C. The remaining culture was DNA prepared using the Qiagen (Chatsworth, 

25 CA) spin miniprep kit according to the manufacturer's instructions. The DNAs from 
the 400 cultures were pooled to make 80 column and row pools. The cDNA 
libraries were determined to contain HBM cDNA clones of interest by PCR. 
Markers were designed to amplify putative exons. Once a standard PCR 
optimization was performed and specific cDNA libraries were determined to contain 

30 cDNA clones of interest, the markers were used to screen the arrayed library. 

Positive addresses indicating the presence of cDNA clones were confirmed by a 
second PCR using the same markers. 
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Once a cDNA library was identified as likely to contain cDNA clones 
corresponding to a specific transcript of interest from the HBM region, it was 
manipulated to isolate the clone or clones containing cDNA inserts identical to the 
EST or direct-selected cDNA of interest. This was accomplished by a modification 
of the standard "colony screening 1 ' method (Sambrook et al, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor NY 
(1989)). Specifically, twenty 150 mm LB+ampicillin agar plates were spread with 
20,000 colony forming units (cfu) of cDNA library and the colonies allowed to grow 
overnight at 37 °C. Colonies were transferred to nylon filters (Hybond from 
Amersham, or equivalent) and duplicates prepared by pressing two filters together 
essentially as described (Sambrook et al, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory, Cold Spring Harbor NY (1989)). The "master" 
plate was then incubated an additional 6-8 hours to allow the colonies to grow back. 
The DNA from the bacterial colonies was then affixed to the nylon filters by treating 
the filters sequentially with denaturing solution (0.5 N NaOH, 1.5 M NaCl) for two 
minutes, neutralization solution (0.5 M Tris-Cl pH 8.0, 1 .5 M NaCl) for two minutes 
(twice). The bacterial colonies were removed from the filters by washing in a 
solution of 2X SSC/0. 1% SDS for one minute while rubbing with tissue paper. The 
filters were air dried and baked under vacuum at 80 °C for 1-2 hours. 

A cDNA hybridization probe was prepared by random hexamer labeling 
(Fineberg and Vogelstein, Anal Biochem,, 132:6-13 (1983)) or by including gene- 
specific primers and no random hexamers in the reaction (for small fragments). 
Specific activity was calculated and was >5X10 8 cpm/10 8 \ig of cDNA. The colony 
membranes were then prewashed in 10 mM Tris-Cl pH 8.0, 1 M NaCl, 1 mM 
EDTA, 0.1% SDS for 30 minutes at 55 °C. Following the prewash, the filters were 
prehybridized in > 2 ml/filter of 6X SSC, 50 % deionized formamide, 2% SDS, 5X 
Denhardt's solution, and 100 mg/ml denatured salmon sperm DNA, at 42 °C for 30 
minutes. The filters were then transferred to hybridization solution (6X SSC, 2% 
SDS, 5X Denhardt's, 100 mg/ml denatured salmon sperm DNA) containing 
denatured a 32 P-dCTP-labeled cDNA probe and incubated at 42°C for 16-18 hours. 

After the 16-18 hour incubation, the filters were washed under constant 
agitation in 2X SSC, 2% SDS at room temperature for 20 minutes, followed by two 



WO 01/77327 



PCT/US00/16951 



-57- 

washes at 65 °C for 15 minutes each. A second wash was performed in 0.5 X SSC, 
0.5% SDS for 15 minutes at 65 °C. Filters were then wrapped in plastic wrap and 
exposed to radiographic film for several hours to overnight. After film 
development, individual colonies on plates were aligned with the autoradiograph so 
5 that they could be picked into a 1 ml solution of LB Broth containing ampicillin. 

After shaking at 37 °C for 1-2 hours, aliquots of the solution were plated on 150 mm 
plates for secondary screening. Secondary screening was identical to primary 
screening (above) except that it was performed on plates containing -250 colonies 
so that individual colonies could be clearly identified for picking. 

10 After colony screening with radiolabeled probes yielded cDNA clones, the 

clones were characterized by restriction endonuclease cleavage, PCR, and direct 
sequencing to confirm the sequence identity between the original probe and the - 
isolated clone. To obtain the full-length cDNA, the novel sequence from the end of 
the clone identified was used to probe the library again. This process was repeated 

1 5 until the length of the cDNA cloned matches that estimated to be full-length by the 
northern blot analysis. 

RT-PCR was used as another method to isolate full length clones. The 
cDNA was synthesized and amplified using a "Superscript One Step RT-PCR" kit 
(Life Technologies, Gaithersburg, MD). The procedure involved adding 1.5 jig of 

20 RNA to the following: 25 \il of reaction mix provided which is a proprietary buffer 
mix with MgS0 4 and dNTP's, 1 \il sense primer (10 jjM) and 1 ^1 anti-sense primer 
(10 jiM), 1 jol reverse transcriptase and Taq DNA polymerase mix provided and 
autoclaved water to a total reaction mix of 50 jil. The reaction was then placed in a 
thermocycler for 1 cycle at 50°C for 15 to 30 minutes, then 94°C for 15 seconds, 

25 55-60°C for 30 seconds and 68-72°C for 1 minute per kilobase of anticipated 

product and finally 1 cycle of 72 °C for 5-10 minutes. The sample was analyzed on 
an agarose gel. The product was excised from the gel and purified from the gel 
(GeneClean, Bio 101). The purified product was cloned in pCTNR (General 
Contractor DNA Cloning System, 5 Prime - 3 Prime, Inc.) and sequenced to verify 

30 that the clone was specific to the gene of interest. 

Rapid Amplification of cDNA ends (RACE) was performed following the 
manufacturer's instructions using a Marathon cDNA Amplification Kit (Clontech, 
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Palo Alto, CA) as a method for cloning the 5' and 3' ends of candidate genes. cDNA 
pools were prepared from total RNA by performing first strand synthesis, where a 
sample of total RNA sample was mixed with a modified oligo (dT) primer, heated to 
70 °C, cooled on ice and followed by the addition of: 5x first strand buffer, 10 mM 
5 dNTP mix, and AMV Reverse Transcriptase (20 U/|il). The tube was incubated at 
42 °C for one hour and then the reaction tube was placed on ice. For second strand 
synthesis, the following components were added directly to the reaction tube: 5x 
second strand buffer, 10 mM dNTP mix, sterile water, 20x second strand enzyme 
cocktail and the reaction tube was incubated at 16°C for 1.5 hours. T4 DNA 

10 Polymerase was added to the reaction tube and incubated at 16°C for 45 minutes. 
The second-strand synthesis was terminated with the addition of an 
EDTA/Glycogen mix. The sample was subjected to a phenol/chloroform extraction 
and an ammonium acetate precipitation. The cDNA pools were checked for quality 
by analyzing on an agarose gel for size distribution. Marathon cDNA adapters 

15 (Clontech) were then ligated onto the cDNA ends. The specific adapters contained 
priming sites that allowed for amplification of either 5* or 3 f ends, depending on the 
orientation of the gene specific primer (GSP) that was chosen. An aliquot of the 
double stranded cDNA was added to the following reagents: 10 pM Marathon 
cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was incubated 

20 at 16 °C overnight. The reaction was heat inactivated to terminate the reaction. PCR 
was performed by the addition of the following to the diluted double stranded cDNA 
pool: lOx cDNA PCR reaction buffer, 10 ]uM dNTP mix, 10 jiM GSP, 10 yM API 
primer (kit), 5 Ox Advantage cDNA Polymerase Mix. Thermal Cycling conditions 
were 94°C for 30 seconds, 5 cycles of 94°C for 5 seconds, 72°C for 4 minutes, 5 

25 cycles of 94°C for 5 seconds, 70°C for 4 minutes, 23 cycles of 94°C for 5 seconds, 
68 °C for 4 minutes. After the first round of PCR was performed using the GSP to 
extend to the end of the adapter to create the adapter primer binding site, exponential 
amplification of the specific cDNA of interest was observed. Usually a second 
nested PCR is performed to confirm the specific cDNA. The RACE product was 

30 analyzed on an agarose gel and then excised and purified from the gel (GeneClean, 
BIO 101). The RACE product was then cloned into pCTNR (General Contractor 
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DNA Cloning System, 5' - 3', Inc.) and the DNA sequence determined to verify that 
the clone is specific to the gene of interest. 
XI. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons 
5 from each gene were subjected to mutation detection analysis. Comparative DNA 
sequencing was used to identify polymorphisms in HBM candidate genes from 
chromosome llql2-13. DNA sequences for candidate genes were amplified from 
patient lymphoblastoid cell lines. 

The inventors developed a method based on analysis of direct DNA 

10 sequencing of PCR products amplified from candidate regions to search for the 

causative polymorphism. The procedure consisted of three stages that used different 
subsets of HBM family to find segregating polymorphisms and a population panel to :• 
assess the frequency of the polymorphisms. The family resources result from a 
single founder leading to the assumption that all affected individuals will share the 

15 same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family 
consisting of the proband, daughter, and her mother, father and brother. 
Monochromosomal reference sequences were produced concurrently and used for 
comparison. The mother and daughter carried the HBM polymorphism in this 

20 nuclear family, providing the ability to monitor polymorphism transmission. The 
net result is that two HBM chromosomes and six non-HBM chromosomes were 
screened. This allowed exclusion of numerous frequent alleles. Only alleles 
exclusively present in the affected individuals passed to the next level of analysis. 

Polymorphisms that segregated exclusively with the HBM phenotype in this 

25 original family were then re-examined in an extended portion of the HBM pedigree 
consisting of two additional nuclear families. These families consisted of five HBM 
and three unaffected individuals. The HBM individuals in this group included the 
two critical crossover individuals, providing the centromeric and telomeric 
boundaries of the critical region. Tracking the heredity of polymorphisms between 

30 these individuals and their affected parents allowed for further refining of the critical 
region. This group brought the total of HBM chromosomes screened to seven and 
the total of non-HBM chromosomes to seventeen. 
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When a given polymorphism continued to segregate exclusively with the 
HBM phenotype in the extended group, a population panel was then examined. This 
panel of 84 persons consisted of 42 individuals known to have normal bone mineral 
density and 42 individuals known to be unrelated but with untyped bone mineral 
5 density. Normal bone mineral density is within two standard deviations of BMD Z 
score 0. The second group was from the widely used CEPH panel of individuals. 
Any segregating polymorphisms found to be rare in this population were 
subsequently examined on the entire HBM pedigree and a larger population. 

Polymerase chain reaction (PCR) was used to generate sequencing templates 

10 from the HBM family ! s DNA and monochromosomal controls. Enzymatic 

amplification of genes within the HBM region on 1 lql2-13 was accomplished using 
the PCR with oligonucleotides flanking each exon as well as the putative 5 f 
regulatory elements of each gene. The primers were chosen to amplify each exon as 
well as 15 or more base pairs within each intron on either side of the splice. All 

15 PCR primers were made as chimeras to facilitate dye primer sequencing. The Ml 3- 
2 IF (5 f - GTA A CGA CGG CCA GT -3') (SEQ ID NO:30) and -28REV (5 T - AAC 
AGC TAT GAC CAT G -3') (SEQ ID NO:31) primer binding sites were built on to 
the 5' end of each forward and reverse PCR primer, respectively, during synthesis. 
150 ng of genomic DNA was used in a 50 |nl PCR with 2 U AmpliTaq, 500 nM 

20 primer and 125 [iM dNTP. Buffer and cycling conditions were specific to each 
primer set. TaqStart antibody (Clontech) was used for hot start PCR to minimize 
primer dimer formation. 10% of the product was examined on an agarose gel. The 
appropriate samples were diluted 1 :25 with deionized water before sequencing. 

Each PCR product was sequenced according to the standard Energy Transfer 

25 primer (Amersham) protocol. All reactions took place in 96 well trays. 4 separate 
reactions, one each for A, C, G and T were performed for each template. Each 
reaction included 2 fxl of the sequencing reaction mix and 3 jlxI of diluted template. 
The plates were then heat sealed with foil tape and placed in a thermal cycler and 
cycled according to the manufacturer's recommendation. After cycling, the 4 

30 reactions were pooled. 3 jal of the pooled product was transferred to a new 96 well 
plate and 1 jil of the manufacturer's loading dye was added to each well. All 96 well 
pipetting procedures occurred on a Hydra 96 pipetting station (Robbins Scientific, 
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USA). 1 jil of pooled material was directly loaded onto a 48 lane gel running on an 
ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

Polyphred (University of Washington) was used to assemble sequence sets 
for viewing with Consed (University of Washington). Sequences were assembled in 

5 groups representing all relevant family members and controls for a specified target 
region. This was done separately for each of the three stages. Forward and reverse 
reads were included for each individual along with reads from the 
monochromosomal templates and a color annotated reference sequence. Polyphred 
indicated potential polymorphic sites with a purple flag. Two readers independently 

0 viewed each assembly and assessed the validity of the purple- flagged sites. 

A total of 23 exons present in the mature mRNA and several other portions 
of the primary transcript were evaluated for hetero2ygosity in the nuclear family of 
two HBM-affected and two unaffected individuals. 25 SNPs were identified, as 
shown in the table below. 



TABLE 4: Single Nucleotide Polymorphisms in the Zmaxl Gene and Environs 



Exon Name 


Location 


Base Change 


b200e21-h_Contigl_l .nt 


69169 (309G) 


C/A 


b200e2 1 -h_Contig4_l 2 .nt 


27402 (309G) 


AJG 


b200e2 l-h_Contig4_13.nt 


27841 (309G) 


T/C 


b200e2 1 -h_Contig4_l 6.nt 


35600 (309G) 


AJG 


b200e2 1 -h_Contig4_2 1 .nt 


45619 (309G) 


G/A 


b200e2 1 -h_Contig4_22.nt-a 


46018 (309G) 


T/G 1 


b200e2 1 -h_Contig4_22.nt-b 


46093 (309G) 


T/G 


b200e2 1 -h_Contig4_22.nt-c 


46190 (309G) 


A/G 


b200e2 1 -h_Contig4_24.nt-a 


50993 (309G) 


T/C 


b200e2 1 -h_Contig4_24.nt-b 


51124 (309G) 


C/T 


b200e2 1 -h_Contig4_25.nt 


55461 (309G) 


C/T 


b200e2 1 -h_Contig4_3 3 .nt-a 


63645 (309G) 


C/A 


b200e2 1 -h_Contig4_33.nt-b 


63646 (309G) 


A/C 


b200e2 1 -h_Contig4_6 1 .nt 


24809 (309G) 


T/G 


b200e2 1 -h_Contig4_62 .nt 


27837 (309G) 


T/C 
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Exon Name 


Location 


Base Change 


b200e2 1 -h_Contig4_63 .nt-a 


31485 (309G) 


C/T 


b200e2 1 -h_Contig4_63.nt-b 


31683 (309G) 


A/G 


b200e2 1 -h_Contig4_9.nt 


24808 (309G) 


T/G 


b527dl2-h_Contig030g_l .nt-a 


31340 (308G) 


T/C 


b527d 1 2-h_Contig030g_l .nt-b 


32538 (308G) 


A/G 


b527dl2-h_Contig080C_2.nt 


13224 (308G) 


A/G 


b527dl2-h_Contig087C_l .nt 


21119 (308G) 


C/A 


b527dl 2-h_Contig087C_4.nt 


30497 (308G) 


G/A 


b527dl2-h_Contig088C_4.nt j 


24811 (309G) 


A/C 


b527dl2-h_Contig089_lHP.nt 


68280 (309G) 


G/A 



In addition to the polymorphisms presented in Table 4, two additional 
polymorphisms can also be present in SEQ ED NO:2. These is a change at position 
2002 of SEQ ID NO:2. Either a guanine or an adenine can appear at this position. 
This polymorphism is silent and is not associated with any change in the amino acid 

15 sequence. The second change is at position 4059 of SEQ ID NO:2 corresponding in 
a cytosine (C) to thymine (T) change. This polymorphism results in a corresponding 
amino acid change from a valine (V) to an alanine (A). Other polymorphisms were 
found in the candidate gene exons and adjacent intron sequences. Any one or 
combination of the polymorphisms listed in Table 4 or the two discussed above 

20 could also have a minor effect on bone mass when present in SEQ ID NO:2. 

The present invention encompasses the nucleic acid sequences having the 
nucleic acid sequence of SEQ ID NO: 1 with the above-identified point mutations. 

Preferably, the present invention encompasses the nucleic acid of SEQ ID 
NO: 2. Specifically, a base-pair substitution changing G to T at position 582 in the 

25 coding sequence of Zmaxl (the HBM gene) was identified as heterozygous in all 
HBM individuals, and not found in the unaffected individuals (i.e., b527dl2- 
h__Contig087C_l.nt). Fig. 5 shows the order of the contigs inB527D12. The 
direction of transcription for the HBM gene is from left to right. The sequence of 
contig308G of B527D12 is the reverse complement of the coding region to the 
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HBM gene. Therefore, the relative polymorphism in contig 308G shown in Table 4 
as a base change substitution of C to A is the complement to the G to T substitution 
in the HBM gene. This mutation causes a substitution of glycine 171 with valine 
(G171V). 

The HBM polymorphism was confirmed by examining the DNA sequence of 
different groups of individuals. In all members of the HBM pedigree (38 
individuals), the HBM polymorphism was observed in the heterozygous form in 
affected (i.e., elevated bone mass) individuals only (N=18). In unaffected relatives 
(N=20) (BMDZ<2.0) the HBM polymorphism was never observed. To determine 
whether this polymorphism was ever observed in individuals outside of the HBM 
pedigree, 297 phenotyped individuals were characterized at the site of the HBM 
gene. None were heterozygous at the site of the HBM polymorphism. In an 
unphenotyped control group, none of 64 individuals were observed to be 
heterozygous at position 582. Taken together, these data prove that the 
polymorphism observed in the kindred displaying the high bone mass phenotype is 
strongly correlated with the G-T polymorphism at position 582 of Zmaxl . 
Furthermore, these results coupled with the ASO results described below, establish 
that the HBM polymorphism genetically segregates with the HBM phenotype, and 
that both the HBM polymorphism and phenotype are rare in the general population. 
XII. Allele Specific Oligonucleotide (ASO) Analysis 

The amplicon containing the HBM1 polymorphism was PCR amplified 
using primers specific for the exon of interest. The appropriate population of 
individuals was PCR amplified in 96 well microtiter plates as follows. PCR 
reactions (20 containing IX Promega PCR buffer (Cat. # Ml 883 containing 1.5 
mM MgCl 2 ), lOOmM dNTP, 200 nM PCR primers (1863F: 

CCAAGTTCTGAGAAGTCC and 1864R: AATACCTGAAACCATACCTG), 1 U 
Amplitaq, and 20 ng of genomic DNA were prepared and amplified under the 
following PCR conditions: 94°C, 1 minute, (94°C, 30 sec; 58°C, 30 sec; 72°C, 1 
min.) X35 cycles), 72°C, 5', 4°C, hold. Loading dye was then added and 10 ^1 of 
the products was electrophoresed on 1.5% agarose gels containing 1 fig/ml ethidium 
bromide at 100-150 V for 5-10 minutes. Gels were treated 20 minutes in denaturing 
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solution (1 .5 M NaCl, 0.5 N NaOH), and rinsed briefly with water. Gels were then 
neutralized in 1 M Tris-HCl, pH 7.5, 1.5 M NaCl, for 20 minutes and rinsed with 
water. Gels were soaked in 10 X SSC for 20 minutes and blotted onto nylon transfer 
membrane (Hybond N+- Amersham) in 10X SSC overnight. Filters were the rinsed 
5 in 6X SSC for 10 minutes and UV crosslinked. 

The allele specific oligonucleotides (ASO) were designed with the 
polymorphism approximately in the middle. Oligonucleotides were phosphate free 
at the 5 9 end and were purchased from Gibco BRL. Sequences of the 
oligonucleotides are: 
1 0 2326 Zmaxl . ASO.g: AGACTGGGGTGAGACGC 

2327Zmaxl.ASO.t: CAGACTGGGTTGAGACGCC 
The polymorphic nucleotides are underlined. To label the oligos, 1.5 of 1 p,g/fil 
ASO oligo (2326.Zmaxl. ASO.g or 2327.Zmaxl.ASO.t), 11 \xl ddH 2 0, 2 jil 10X 
kinase forward buffer, 5 ^1 y 32 P-ATP (6000 Ci/mMole), and 1 nl T4 polynucleotide 
15 kinase (10 U/jil) were mixed, and the reaction incubated at 37°C for 30-60 minutes. 
Reactions were then placed at 95 °C for 2 minutes and 30 ml H 2 0 was added. The 
probes were purified using a G25 microspin column (Pharmacia). 

Blots were prehybridized in 10 ml 5X SSPE, 5X Denhardt's, 2% SDS, and 
1 00 ^g/ml, denatured, sonicated salmon sperm DNA at 40 °C for 2 hr. The entire 
20 reaction mix of kinased oligo was then added to 10 ml fresh hybridization buffer 
(5X SSPE, 5X Denhardt's, 2% SDS) and hybridized at 40°C for at least 4 hours to 
overnight. 

All washes done in 5X SSPE, 0.1 % SDS. The first wash was at 45 °C for 15 
minutes; the solution was then changed and the filters washed 50 °C for 15 minutes. 
25 Filters were then exposed to Kodak biomax film with 2 intensifying screens at 

-70 °C for 15 minutes to 1 hr. If necessary the filters were washed at 55 °C for 15 
minutes and exposed to film again. Filters were stripped by washing in boiling 0.1X 
SSC, 0.1% SDS for 10 minutes at least 3 times. 

The two films that best captured the allele specific assay with the 2 ASOs 
30 were converted into digital images by scanning them into Adobe PhotoShop. These 





WO 01/77327 



PCT/US00/16951 



-65- 



10 



15 



20 



images were overlaid against each other in Graphic Converter and then scored and 
stored in FileMaker Pro 4.0 (see Fig. 9). 

In order to determine the HBM1 allele frequency in ethnically diverse 
populations, 672 random individuals from various ethnic groups were typed by the 
allele specific oligonucleotide (ASO) method. This population included 96 CEPH 
grandparents (primarily Caucasian), 192 Caucasian, 192 African- American, 96 
Hispanic, and 96 Asian individuals. No evidence was obtained for the presence of 
the HBM1 polymorphism in any of these individuals. Overall, a total of 91 1 
individuals were typed either by direct sequencing or ASO hybridization; all were 
homozygous GG at the site of the HBM polymorphism (Fig. 14). This information 
illustrates that the HBM1 allele is rare in various ethnic populations. 

Thus this invention provides a rapid method of identifying individuals with 
the HBM1 allele. This method could be used in the area of diagnostics and 
screening of an individual for susceptibility to osteoporosis or other bone disorder. 
The assay could also be used to identify additional individuals with the HBM1 allele 
or the additional polymorphisms described herein. 

XIII. Cellular Localization of Zmaxl 

A. Gene Expression in Rat tibia by non isotopic In Situ Hybridization 

In situ hybridization was conducted by Pathology Associates International 
(PAI), Frederick, MD. This study was undertaken to determine the specific cell 
types that express the Zmaxl gene in rat bone with particular emphasis on areas of 
bone growth and remodeling. Zmaxl probes used in this study were generated from 
both human (HuZmaxl) and mouse (MsZmaxl) cDNAs, which share an 87% 
sequence identity. The homology of human and mouse Zmaxl with rat Zmaxl is 
unknown. 

For example, gene expression by non-isotopic in situ hybridization was 
performed as follows, but other methods would be known to the skilled artisan. 
Tibias were collected from two 6 to 8 week old female Sprague Dawley rats 
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euthanized by carbon dioxide asphyxiation. Distal ends were removed and proximal 
tibias were snap frozen in OCT embedding medium with liquid nitrogen 
immediately following death. Tissues were stored in a -80 °C freezer. 

Probes for amplifying PCR products from cDNA were prepared as follows. 
The primers to amplify PCR products from a cDNA clone were chosen using 
published sequences of both human LRP5 (Genbank Accession No. AB017498) and 
mouse LRP5 (Genbank Accession No. AF064984). In order to minimize cross 
reactivity with other genes in the LDL receptor family, the PCR products were 
derived from an intracellular portion of the protein coding region. PCR was 
performed in a 50 jul reaction volume using cDNA clone as template. PCR reactions 
contained 1.5 mM MgCl 2 , 1 unit Amplitaq, 200 dNTPs and 2 |nM each primer. 
PCR cycling conditions were 94°C for 1 min., followed by 35 cycles of 94°C for 30 
seconds, 55 °C for 30 seconds, 72 °C for 30 seconds; followed by a 5 minute 
extension at 72° C. The reactions were then run on a 1.5% agarose Tris-Acetate gel. 
DNA was eluted from the agarose, ethanol precipitated and resuspended in 10 mM 
Tris, pH 8.0. Gel purified PCR products were prepared for both mouse and human 
cDNAs and supplied to Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PCR primers and products were as 
follows: 

Human Zmax 1 sense primer (HBM1253^ 
CCCGTGTGCTCCGCCGCCCAGTTC 
Human Zmax 1 antisense primer fHBM1465) 
GGCTCACGGAGCTCATCATGGACTT 
Human Zm axl PCR product 

CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGA 
CCTGCGCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGAC 
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GAGGTGGACTGTGACGCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGA 
GCGGCCAGTGTGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGT 
ATCGACGGCTCCGACGAGCTCATGTGTGAAATCACCAAGCCGCCCTCAG 
ACGACAGCCCGGCCCACAGCAGTGCCATCGGGCCCGTCATTGGCATCAT 
5 CCTCTCTCTCTTCGTCATGGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGT 
GTGCCAGCGCTATGCGGGGGCCAACGGGCCCTTCCCGCACGAGTATGTC 
AGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGGGCGGTTCCC 
AGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGAGCTC 
CGTGAGCC 

10 Mouse Zmax 1 Sense primer rHBM1655^ 
AGCGAGGCCACCATCCACAGG 
Mouse Zmax 1 antisense primer (HBM1656) 
TCGCTGGTCGGCATAATCAAT 
Mouse Zmaxl PCR product 

1 5 AGC AGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATG 
TGGCTATCCCACTCACGGGTGTCAAAGAGGCCTCTGCACTGGACTTTGAT 
GTGTCCAACAATCACATCTACTGGACTGATGTTAGCCTCAAGACGATCA 
GCCGAGCCTTCATGAATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGG 
CCTCGACTACCCTGAAGGAATGGCTGTGGACTGGATGGGCAAGAACCTC 

20 TATTGGGCGGACACAGGGACCAACAGGATTGAGGTGGCCCGGCTGGATG 
GGCAGTTCCGGCAGGTGCTTGTGTGGAGAGACCTTGACAACCCCAGGTC 
TCTGGCTCTGGATCCTACTAAAGGCTACATCTACTGGACTGAGTGGGGTG 
GCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACCAATTGTATGAC 
ACTGGTAGACAAGGTGGGCCGGGCCAACGACCTCACCATTGATTATGCC 

25 GACCAGCGA 
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Riboprobes were synthesized as follows. The PCR products were 
reamplified with chimeric primers designed to incorporate either a T3 promoter 
upstream, or a T7 promoter downstream of the reamplification products. The 
resulting PCR products were used as template to synthesize digoxigenin-labeled 
5 riboprobes by in vitro transcription (IVT). Antisense and sense riboprobes were 
synthesized using T7 and T3 RNA polymerases, respectively, in the presence of 
digoxigenin-1 1-UTP (Boehringer-Mannheim) using a MAXIscript IVT kit 
(Ambion) according to the manufacturer. The DNA was then degraded with Dnase- 
1, and unincorporated digoxigenin was removed by ultrafiltration. Riboprobe 

10 integrity was assessed by electrophoresis through a denaturing polyacrylamide gel. 
Molecular size was compared with the electrophoretic mobility of a 100-1000 base 
pair (bp) RNA ladder (Ambion). Probe yield and labeling was evaluated by blot 
immunochemistry. Riboprobes were stored in 5 |il aliquots at -80°C. 

The in situ hybridization was performed as follows. Frozen rat bone was cut 

15 into 5 |uM sections on a Jung CM3000 cryostat (Leica) and mounted on adhesive 

slides (Instrumedics). Sections were kept in the cryostat at -20 °C until all the slides 
were prepared in order to prevent mRNA degradation prior to post-fixation for 15 
minutes in 4% paraformaldehyde. Following post-fixation, sections were incubated 
with 1 ng/jxl of either antisense or sense riboprobe in Pathology Associates 

20 International (PAI) customized hybridization buffer for approximately 40 hours at 
58°C. Following hybridization, slides were subjected to a series of post- 
hybridization stringency washes to reduce nonspecific probe binding. Hybridization 
was visualized by immunohistochemistry with an anti-digoxigenin antibody (FAB 
fragment) conjugated to alkaline phosphatase. Nitroblue tetrazolium 

25 chloride/bromochloroindolyl phosphate (Boehringer-Mannheim), a precipitating 

alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells 
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purple to nearly black, depending on the degree of staining. Tissue sections were 
counter-stained with nuclear fast red. Assay controls included omission of the 
probe, omission of probe and anti-digoxigenin antibody. 

Specific cell types were assessed for demonstration of hybridization with 
5 antisense probes by visualizing a purple to black cytoplasmic and/or peri-nuclear 
staining indicating a positive hybridization signal for mRNA. Each cell type was 
compared to the replicate sections, which were hybridized with the respective sense 
probe. Results were considered positive if staining was observed with the antisense 
probe and no staining or weak background with the sense probe. 

1 0 The cellular localization of the hybridization signal for each of the study ... 

probes is summarized in Table 5. Hybridization for Zmaxl was primarily detected 
in areas of bone involved in remodeling, including the endosteum and trabecular 
bone within the metaphysis. Hybridization in selected bone lining cells of the 
periosteum and epiphysis were also observed. Positive signal was also noted in 

15 chondrocytes within the growth plate, particularly in the proliferating chondrocytes. 
See Figs. 10, 1 1 and 12 for representative photomicrographs of in situ hybridization 
results. 

TABLE 5 

Summary of Zmaxl in situ hybridization in rat tibia 



PROBE 


SITE 


ISH SIGNAL 


Hu Zmaxl 


Epiphysis 






Osteoblasts 






Osteoclasts 






Growth Plate 






resting chondrocytes 






proliferating chondrocytes 


+ 




hypertrophic chondrocytes 






Metaphvsis 






osteoblasts 






osteoclasts 


+ 
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hypertrophic chondrocytes 






IVTetaiYhv^i c 






osteoblasts 


+ 




osteoclasts 






Diaphysis 






Endosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 


+ 









Legend: = hybridization signal detected = no hybridization signal detected 
"ISH" - In situ hybridization 

These studies confirm the positional expression of Zmaxl in cells involved 
5 in bone remodeling and bone formation. Zmaxl expression in the zone of 

proliferation and in the osteoblasts and osteoclasts of the proximal metaphysis, 
suggests that the Zmaxl gene is involved in the process of bone growth and 
mineralization. The activity and differentiation of osteoblasts and osteoclasts are 
closely coordinated during development as bone is formed and during growth as 
10 well as in adult life as bone undergoes continuous remodeling. The formation of 
internal bone structures and bone remodeling result from the coupling of bone 
resorption by activated osteoclasts with subsequent deposition of new material by 
osteoblasts. Zmaxl is related to the LDL receptor gene, and thus may be a receptor 
involved in mechanosensation and subsequent signaling in the process of bone 
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remodeling. Therefore, changes in the level of expression of this gene could impact 
on the rate of remodeling and degree of mineralization of bone. 
XIV. Antisense 

Antisense oligonucleotides are short synthetic nucleic acids that contain 
5 complementary base sequences to a targeted RNA. Hybridization of the RNA in 
living cells with the antisense oligonucleotide interferes with RNA function and 
ultimately blocks protein expression. Therefore, any gene for which the partial 
sequence is known can be targeted by an antisense oligonucleotide. 

Antisense technology is becoming a widely used research tool and will play 

10 an increasingly important role in the validation and elucidation of therapeutic targets 
identified by genomic sequencing efforts. 

Antisense technology was developed to inhibit gene expression by utilizing 
an oligonucleotide complementary to the mRNA that encodes the target gene. There 
are several possible mechanisms for the inhibitory effects of antisense 

15 oligonucleotides. Among them, degradation of mRNA by RNase H is considered to 
be the major mechanism of inhibition of protein function. This technique was 
originally used to elucidate the function of a target gene, but may also have 
therapeutic applications, provided it is designed carefully and properly. 
An example of materials and methods for preparing antisense 

20 oligonucleotides can be performed as follows. Preliminary studies have been 
undertaken in collaboration with Sequiter (Natick, MA) using the antisense 
technology in the osteoblast-like murine cell line, MC3T3. These cells can be 
triggered to develop along the bone differentiation sequence. An initial proliferation 
period is characterized by minimal expression of differentiation markers and initial 

25 synthesis of collagenous extracellular matrix. Collagen matrix synthesis is required 
for subsequent induction of differentiation markers. Once the matrix synthesis 
begins, osteoblast marker genes are activated in a clear temporal sequence: alkaline 
phosphatase is induced at early times while bone sialoprotien and osteocalcin appear 
later in the differentiation process. This temporal sequence of gene expression is 

30 useful in monitoring the maturation and mineralization process. Matrix 

mineralization, which does not begin until several days after maturation has started, 
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involves deposition of mineral on and within collagen fibrils deep within the matrix 
near the cell layer-culture plate interface. The collagen fibril-associated mineral 
formed by cultured osteoblasts resembles that found in woven bone in vivo and 
therefore is used frequently as a study reagent. 
5 MC3T3 cells were transfected with antisense oligonucleotides for the first 

week of the differentiation, according to the manufacturer's specifications (U.S. 
Patent No. 5,849,902). 

The oligonucleotides designed for Zmaxl are given below: 
10875: AGUACAGCUUCUUGCCAACCCAGUC 
10 10876: UCCUCCAGGUCGAUGGUCAGCCCAU 
10877: GUCUGAGUCCGAGUUCAAAUCCAGG 

Fig. 13 shows the results of antisense inhibition of Zmaxl in MC3T3 cells. The 
three oligonucleotides shown above were transfected into MC3T3 and RNA was 
isolated according to standard procedures. Northern analysis clearly shows 
15 markedly lower steady state levels of the Zmaxl transcript while the control gene 
GAPDH remained unchanged. Thus, antisense technology using the primers 
described above allows for the study of the role of Zmaxl expression on bone 
biology. 

XV. Yeast Two Hybrid 

20 In order to identify the signaling pathway that Zmaxl participates in to 

modulate bone density, the yeast two hybrid protein interaction technology was 
utilized. This technique facilitates the identification of proteins that interact with 
one another by coupling tester proteins to components of a yeast transcription 
system (Fields and Song, 1989, Nature 340: 245-246; U.S. Pat. No. 5,283,173 by 

15 Fields and Song; Johnston, 1987 Microbiol Rev. 51: 458-476; Keegan et al, 1986, 
Science 231: 699-704; Durfee et al, 1993, Genes Dev. 7: 555-569; Chien et al, 1991, 
Proc. Natl Acad. Sci USA 88: 9578-9582; Fields et al., 1994, Trends in Genetics 10: 
286-292; and Gyuris et al., 1993, Cell 75: 791-803). First a "bait' ' protein, the 
protein for which one seeks interacting proteins, is fused to the DNA binding 

>0 domain of a yeast transcription factor. Second, a cDNA library is constructed that 
contains cDNAs fused to the transcriptional activation domain of the same yeast 
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transcription factor; this is termed the prey library. The bait construct and prey 
library are transformed into yeast cells and then mated to produce diploid cells. If 
the bait interacts with a specific prey from the cDNA library, the activation domain 
is brought into the vicinity of the promoter via this interaction. Transcription is then 
5 driven through selectable marker genes and growth on selective media indicates the 
presence of interacting proteins. 

The amino acid sequence used in the yeast two hybrid experiments discussed 
herein consisted of the entire cytoplasmic domain and a portion of the 
transmembrane domain and is shown below (amino to carboxy orientation): 

10 RWCQRYAGA NGPFPHEYVS GTPHVPLNFI APGGSQHGPF TGIACGKSMM 
SSVSLMGGRG GVPLYDRNHV TGASSSSSSS TKATLYPPI L NPPPSPA TDP 
SLYNMDMFYS SNIPATVRPY RPYIIRG MAP PTTP CSTDVC DSDYSASRWK 
AS KY YLDLNS DSDP YPPPPT PH SOYLSAE D SCPPSPA TER SYFHLFPPPP 
SPC TDSS 

15 The last 6 amino acids of the putative transmembrane domain are indicated 

in bold. Putative SH3 domains are underlined. Additional amino acid sequences of 
50 amino acids or greater in either the proteins encoded by the Zmaxl or HBM 
alleles can also be used as bait. The upper size of the polypeptide used as bait is 
limited only by the presence of a complete transmembrane domain (see Fig. 4), 

20 which will render the bait to be nonfunctional in a yeast two hybrid system. These 
additional bait proteins can be used to identify additional proteins which interact 
with the proteins encoded by HBM or Zmaxl in the focal adhesion signaling 
pathway or in other pathways in which these HBM or Zmaxl proteins may act. 
Once identified, methods of identifying agents which regulate the proteins in the 

25 focal adhesion signaling pathway or other pathways in which HBM acts can be 
performed as described herein for the HBM and Zmaxl proteins. 

In order to identify cytoplasmic Zmaxl signaling pathways, the cytoplasmic 
domain of Zmaxl was subcloned into two bait vectors. The first vector was 
pDBleu, which was used to screen a brain, and Hela prey cDNA library cloned into 

30 the vector pPC86 (Clontech). The second bait vector used was pDBtrp, which was 
used to screen a cDNA prey library derived from the TE85 osteosarcoma cell line in 
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vector pOP46. Standard techniques known to those skilled in the art were used as 
described in Fields and Song, 1989, Nature 340: 245-246; U.S. Pat. No. 5,283,173 
by Fields and Song; Johnston, 1987 ', Microbiol. Rev. 51: 458-476; Keegan et al., 
1986, Science 231: 699-704; Durfee et al., 1993, Genes Dev. 7: 555-569; Chien et 
al., 1991, Proc. Natl. Acad. Sci USA 88: 9578-9582; Fields et al., 1994, Trends in 
Genetics 10: 286-292; and Gyuris et al., 1993, Cell 75: 791-803. The bait construct 
and prey cDNA libraries were transformed into yeast using standard procedures. 

To perform the protein interaction screen, an overnight culture of the bait 
yeast strain was grown in 20 ml SD selective medium with 2% glucose (pDBLeu, 
SD -Leu medium, pDBtrp, SD -tip medium). The cultures were shaken vigorously 
at 30°C overnight. The cultures were diluted 1 : 10 with complete medium (YEPD 
with 2% glucose) and the cultures then incubated with shaking for 2 hrs at 30°C. 

The frozen prey library was thawed, and the yeast cells reactivated by 
growing them in 150 ml YEPD medium with 2% glucose for 2 hrs at 30°C. A filter 
unit was sterilized with 70% ethanol and washed with sterile water to remove the 
ethanol. The cell densities of both bait and prey cultures were measured by 
determining the OD at 600 nm. An appropriate volume of yeast cells that 
corresponded to a cell number of 1 ml of OD 600 = 4 of each yeast strain, bait and 
prey (library) was placed in a 50 ml Falcon tube. The mixture was then filtered 
through the sterilized filter unit. The filter was then transferred onto a prewarmed 
YEPD agar plate with the cell side up, removing all air bubbles underneath the filter. 
Plates were then incubated at 30°C for 6 hrs. One filter was transferred into a 50 ml 
Falcon tube, and 10 ml of SD with 2% Glucose was added; cells were resuspended 
by vortexing for 10 sec. 

The number of primary diploid cells (growth on SD -Leu, -Trp plates) versus 
the numbers of colony forming units growing on SD -Trp and SD -Leu plates only 
was then titered. Different dilutions were plated and incubated at 30°C for two days. 
The number of colony forming units was then counted. The number of diploid 
colonies (colonies on SD -Leu -Trp plates) permits the calculation of whether or not 
the whole library of prey constructs was mated to the yeast expressing the bait. This 
information is important to judge the quality of the screen. 
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Indirect selection 
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Resuspended cells from 5 filtermatings were then pooled and the cells 
sedimented by centrifugation in a 50 ml Falcon tube. Cells were then resuspended 
in 16 ml SD medium with 2% Glc. Two ml of this cell suspension was plated onto 8 
square plates each (SD -Leu, -Trp) with sterile glass beads and selected for diploid 
cells by incubating at 30 °C for 18 - 20 hrs. 

Cells were then scraped off the square plates, the cells centrifuged and 
combined into one 50 ml Falcon tube. The cell pellet was then resuspended in 25 ml 
of SD medium with 2% glucose. The cell number was then determined by counting 
of an appropriate dilution (usually 1 : 1 00 to 1:1 000) with a Neugebauer chamber. 
Approximately 5 x 10 7 diploid cells were plated onto the selective medium. The 
observations about the growth of the bait strain together with irrelevant prey vectors 
helps to determine which selective plates will have to be chosen for the library 
screen. Generally, all screens were plated on one square plate each with SD -Leu, - 
Trp, -His; SD -Leu, -Trp, His, 5 mM 3AT, and SD -Leu, -Trp, -His, -Ade is 
recommended. 

The yeast cells were then spread homogeneously with sterile glass beads and 
incubated at 30°C for 4 days. The number of colony forming yeast cells was titered 
by plating different dilutions of the scraped cell suspension onto SD -Leu, -Trp 
plates. Usually, plating of 100 \i\ of a 10~ 3 and 10" 4 dilution gave 100 - 1000 colonies 
per plate. 

B. Direct selection 

Five filters with the mated yeast cells were each transferred into separate 50 
ml Falcon tubes and the cells resuspended with 10 ml SD medium with 2% Glc, 
each, followed by vortexing for 10 sec. The resuspended cells were combined and 
centrifuged in a Beckman centrifuge at 3000 rpm. The supernatant was discarded 
and the cells resuspended in 6 ml of SD medium with 2% Glc. Two ml of the 
suspension was spread onto each selective square plate and incubated at 30° C for 4 - 
5 days. 
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C. Isolation of Single Colonies 

Yeast cells from an isolated colony were picked with a sterile tooth pick and 
transferred into individual wells of a 96 well plate. The cells were resuspended in 50 
pi of SD -Leu, -Trp, -His medium and incubated at 30 °C for one day. The yeast 
5 cells were then stamped onto a SD -Leu, -Trp, -His plate in 96 well format and 
incubated at 30 °C for 2 days. Yeast cells were also stamped onto a Nylon filter 
covering a YEPD plate and incubated at 30°C for one day. The cells on the Nylon 
filter were used for the analysis of the J3 - Gal reporter activity. 

Yeast colonies were scraped from the SD -Leu, -Trp, -His plate with a sterile 
1 0 tooth pick, and reconfigured, if necessary, according to the 13 - Gal activity and then 
resuspended in 20 % glycerol. This served as a master plate for storage at -80°C. 

For DNA preparation, yeast cells from the glycerol stock were stamped onto 
a SD -Trp plate and incubated at 30°C for 2 days. After two days of incubation, the 
yeast colonies were ready for colony PCR and sequencing. Standard colony PCR 
1 5 conditions were used to amplify inserts from preys recovered from the interaction 
screen. Sequencing was done using standard sequencing reactions and ABI377 
(Perkin Elmer) fluorescent sequencing machines. 
D. Verification of bait/prey interaction 

Glycerol stocks of the prey of interest were thawed and inoculated in a 10 ml 
20 overnight culture of SD with glucose -Trp. After overnight growth, plasmid DNA 
preparation was performed using the BIO 101 RPM Yeast Plasmid Isolation Kit 
with 10 ml of culture. The culture was centrifuged and transfered to a 1.5 ml 
microcentrifuge tube. Yeast Lysis Matrix was then added to the pellet followed by 
250 pi of Alkaline Lysis Solution. Samples were then vortexed for 5 minutes. 250 
25 pi Neufralizing Solution was added and the sample mixed briefly. Samples were 

centrifuged for 2 minutes at room temperature in a microcentrifuge. The supernatant 
was transferred to a Spin Filter avoiding debris and Lysis Matrix. 250 pi of 
Glassmilk Spin Buffer was added, and the tubes inverted to mix. Samples were 
centrifuged for 1 min and the liquid in the Catch Tube was discarded. 500 pi of 
30 Wash Solution was added, the samples were centrifuged for 1 min, and the wash 
solution was discarded. The wash step was repeated once followed by a 1 min dry 
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centrifugation to drive the remaining liquid out of the Spin Filter. The filter was 
transferred to a new Catch Tube and 100 \il of sterile H 2 0 was added; samples were 
then vortexed briefly to resuspend and centrifuged for 30 seconds to collect the 
DNA in the bottom of the Catch Tube. 
5 Five of DNA was then transformed into DH10B Electromax cells using 

standard procedures and glycerol stocks prepared. Miniprep DNA was prepared 
using the Qiagen QIAprep Spin Miniprep Kit. DNA was finally eluted with 30 jlxI of 
Qiagen EB buffer. One jil of the plasrnid DNA samples was then used to transform 
yeast cells using standard procedures. After 2 days of growth on SD — trp media, 

10 colonies were picked and patched onto fresh media. Similarly, bait colonies were 
patched onto SD -Leu media. Both were grown overnight at 30 °C. 

For mating, cells from bait and prey patches were spread together on YAPD 
media and incubated at 30°C for 12 hr. This plate was then replicaplated onto an SD 
Agar-Leu-Trp plate and grown for 2 days at 30°C. To test the strength of 

15 interaction these plates were replicaplated onto SD Agar-Leu-Trp-His, SD Agar- 
Leu-Trp-His with 5 mM 3 AT and 10 mM 3 AT, SD Agar-Leu-Trp-His- Ade, and 
SD Agar-Leu-Trp-Ura media and grown for 2 days at 30 °C. 
E. Galacton Star (5-Galactosidase Activity Assay 
After streaking and replica plating positive interactors on selection plates, 

20 colonies were placed in a 96 well dish with 200 jal of SD-medium, leaving wells 1 

and 96 blank. Ten microliters from the first 96 well dish was plated into another flat 
bottom 96 well dish containing 100 jil of SD-medium. Controls consisted of a 
negative control and a very weak positive control. The cell density was measured at 
OD 600 (a value of 1 corresponds to lxlO 7 cells utilizing a 96 well 

25 spectrophotometer). The OD was usually between 0.03 and 0.10. Using 

microplates specifically for the luminometer, 50 \i\ of reaction mixture were pipetted 
into each well. Fifty microliters of culture were then added and mixed by pipetting 
up and down twice. The reaction was incubated for 30 minutes at room temperature 
followed by measurement of Relative Light Units using a luminometer. 

30 Table 6 lists the genes identified in the yeast two hybrid screens from the 3 

prey libraries tested. Two genes, zyxin and axin, were found to interact with the 
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cytoplasmic domain of Zmaxl in all three screens. Three genes, alpha-actinin, TCB 
and Sl-5 interacted in two of the three screens. 

A variety of proteins found at sites of cell-cell and cell-matrix contact (focal 
contacts/adesion plaques) were shown to interact with the cytoplasmic domain of 
Zmaxl. These include alpha-actinin, Trio, Pinch-like protein, and Zyxin. PINCH is 
a LIM domain-containing protein that is known to interact with integrin-linked 
kinase, an early signaler in integrin and growth factor signaling pathways. The 
finding of a closely related gene in the yeast two hybrid screen raises the possibility 
of a novel pathway linked to integrin signaling from extracellular matrix signals. 
Trio, also known to localize to focal adhesions, is thought to play a key role in 
coordinating cell-matrix interactions and cytoskeletal rearrangements involved in 
cell movement. Zyxin, another LIM domain-containing protein, is also localized to 
adhesion plaques and is thought to be involved in reorganization of the cytoskeleton 
when triggers are transmitted via integrin signaling pathways. Zyxin also interacts 
with alpha actinin, which we identified as interacting with Zmaxl . Other LIM 
domain containing proteins identified include the human homologue of mouse 
ajuba, LIMD 1 , and a novel LIMD 1 -like protein. 

Axin was also identified from the two hybrid experiments. This protein is 
involved in inhibition of the Wnt signaling pathway and interacts with the tumor 
suppressor APC. There is a link here with the focal adhesion signaling described 
above: one common step in the two pathways involves inhibition of glycogen 
synthase kinase 3, which in turn results in the activation of B-catenin/Lef-1 and AP-1 
transcription factors. Axin/APC are involved in this as well as integrin linked 
kinase. The Wnt pathway has a role in determining cell fates during embryogenesis. 
If inappropriately activated, the Wnt pathway may also lead to cancer. The Wnt 
pathway also seems to have a role in cytoskeletal rearrangements. A model 
depicting Zmaxl involvement in focal adhesion signaling is depicted in Fig. 15. 

This data coupled with other studies suggest that integrin signaling pathways 
have a role in cellular responses to mechanical stress and adhesion. This provides an 
attractive model for the mechanism of action of Zmaxl in bone biology. It is 
possible that Zmaxl is involved in sensing either mechanical stress directly or 
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binding a molecule in the extracellular matrix that is related to mechanical sensation. 
Signaling through subsequent pathways may be involved in bone remodeling due to 
effects on cell morphology, cell adhesion, migration, proliferation, differentiation, 
and apoptosis in bone cells. 

5 Table 6: Yeast Two Hybrid Results 



Gene 
Symbol 


Gene 


Genbank 
Accession # 


NT 
SEQID 
NO: 


AA 
SEQID 
NO: 


ACINI 


alpha-actinin 


NM_001102 


63 




AES 


ammo-terminal enhancer of 


NM 001130.3 


64 




ATP4 


atrophin-1 interacting protein 


AF038564.1 


65 




Novel 


Ajuba 




66 




AXDSf 


Wnt signaling 


AF009674.1 


67 




CDC23 


cell division cycle 23, yeast, 
homolog 


NM_004661.1 


68 




HSM800944 


Similar to TRIO 


AL117435.1 


69 




HSM800936 




AL1 17427.1 


70 




Novel 


Similar to LIM domains 
containing protein 1 




71 




DEEPEST 


mitotic spindle coiled-coil 
related protein 


NM_006461.l 


72 




ECM1 


extracellular matrix protein 1 


U65932.1 


73 




EF1A 


elongation factor 1 -alpha 


X16869.1 


74 




FN 


fibronectin 


X02761.1 


75 




HOXB13 


homeodomain protein 


U81599.1 


76 




Novel 


Glu-Lys Rich protein 




77 




LIMD1 


LIM domains containing 1 


NM_014240.1 


78 




Novel 


PINCH-like 




79 




RANBPM 


centrosomal protein 


NM_005493.1 


L 80 




Sl-5 


extracellular protein 


U03877.1 


81 




TCB 


gene encoding cytosolic 
thyroid hormone-binding 


M26252.1 


82 




TED 


tumorous imaginal discs 


NM_005147.1 


83 




ZYX 


Zyxin 


NM_003461.1 


84 




TRIO 


GTPase 


U42390.1 


85 




HUMPITPB 


phosphatidylinositol transfer 
protein 


D30037.1 


86 




ACTN1 


alpha-actinin 


NP_001093.1 




87 
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Gene 
Symbol 


Gene 


Genbank 
Accession # 


NT 
SEQID 
NO: 


AA 
SEQID 
NO: 


AES 


amino-terminal enhancer of 


NP_001 121.2 




88 


AIP4 


atrophin-1 interacting protein 


AAC04845.1 




89 


Novel 


Ajuba 






90 


AXIN 


Wnt signalling 


AAC5 1624.1 




91 


CDC23 


cell division cycle 23, yeast 
homolog 


NP_004652.1 




92 


Novel 


Similar to TRIO CAB55923.1 






93 


Novel 


Similar to LIM domains 
containing protein 1 






94 


DEEPEST 


mitotic spindle coiled-coil 
related protein 


NP_006452.1 




95 


ECM1 


extracellular matrix protein 1 


AAB05933.1 




96 


EF1A 


elongation factor 1 -alpha 


CAA34756.1 




97 


FN 


fibronectin 


CAA26536.1 




98 


Novel 


Glu-Lys rich protein 






99 


HOXB13 


homeodomain protein B 1 3 


AAB39863.1 




100 


LIMD1 


LIM domains containing 1 


NP_055055.1 




101 


Novel 


PINCH-like 






102 


RANBPM 


centrosomal protein 


NP_005484.1 




103 


Sl-5 


extracellular protein 


AAA65590.1 




104 


TCB 


cytosolic thyroid hormone- 
binding protein 


AAA36672.1 




105 


TED 


tumorous imaginal discs 


NP_005 138.1 




106 


ZYX 


Zyxin 


NP_003452.1 




107 


TRIO 


GTPase 


AAC34245.1 




108 


PTDINSTP 


phosphatidylinositol transfer 
protein beta isoform 


P48739 




109 



In light of the model depicted in Fig. 15 and the results shown in Table 6, 
another aspect contemplated by the invention would be to regulate bone density and 
25 bone mass disorders by the regulating focal adhesion signaling. The regulation can 
occur by regulating the DNA, mRNA transcript or protein encoded by any of the 
members involved in the focal adhesion signaling pathway as identified by the yeast 
two hybrid system. 

Also contemplated are the novel nucleic acids and proteins identified by the 
30 HBM yeast two hybrid system. These include but are not limited to SEQ ID NO: 66 
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(Ajuba), SEQ ID NO: 71 (a gene similar to a gene encoding LIM domains 
containing protein 1), SEQ ID NO: 77 (Glu-Lys Rich protein), SEQ ID NO: 79 
(PINCH-like gene), SEQ ID NO: 90 (Ajuba protein), SEQ ID NO: 93 (protein 
similar to TRIO), SEQ ID NO: 94 0, SEQ ID NO: 99 (Glu-Lys rich protein) and 
5 SEQ ID NO: 102 (PINCH-like protein). 
XVI. Potential Function 

The protein encoded by Zmaxl is related to the Low Density Lipoprotein 
receptor (LDL receptor). See, Goldstein et al, Ann. Rev. Cell Biology, 1:1-39 
(1985); Brown et al, Science, 232:34-47 (1986). The LDL receptor is responsible 

10 for uptake of low density lipoprotein, a lipid-protein aggregate that includes 
cholesterol. Individuals with a defect in the LDL receptor are deficient in 
cholesterol removal and tend to develop artherosclerosis. In addition, cells with a 
defective LDL receptor show increased production of cholesterol, in part because of 
altered feedback regulation of cholesterol synthetic enzymes and in part because of 

15 increased transcription of the genes for these enzymes. In some cell types, 
cholesterol is a precursor for the formation of steroid hormones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal 
transduction protein and may regulate gene expression. Because Zmaxl is related to 
the LDL receptor, this protein may also be involved in signaling between cells in a 

20 way that affects bone remodeling. 

The glycine 171 amino acid is likely to be important for the function of 
Zmaxl because this amino acid is also found in the mouse homologue of Zmaxl. 
The closely related LRP6 protein also contains glycine at the corresponding position 
(Brown et al, Biochemical and Biophysical Research Comm., 248:879-888 (1988)). 

25 Amino acids that are important in a protein's structure or function tend to be 
conserved between species, because natural selection prevents mutations with 
altered amino acids at important positions from arising. 

In addition, the extracellular domain of Zmaxl contains four repeats 
consisting of five YWTD motifs followed by an EFG motif. This 5YWTD+EGF 

30 repeat is likely to form a distinct folded protein domain, as this repeat is also found 
in the LDL receptor and other LDL receptor-related proteins. The first three 
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5 YWTD+EGF repeats are very similar in their structure, while the fourth is highly 
divergent. Glycine 171 occurs in the central YWTD motif of the first 
5 YWTD+EGF repeat in Zmaxl. The other two similar 5 YWTD+EGF repeats of 
Zmaxl also contain glycine at the corresponding position, as does the 
5 5YWTD+EGF repeat in the LDL receptor protein. However, only 17.6% of the 

amino acids are identical among the first three 5 YWTD+EGF repeats in Zmaxl and 
the single repeat in the LDL receptor. These observations indicate that glycine 171 
is essential to the function of this repeat, and mutation of glycine 171 causes a 
functional alteration of Zmaxl . The cDNA and peptide sequences are shown in 
10 Figs. 6A-6E. The critical base at nucleotide position 582 is indicated in bold and is 
underlined. 

Northern blot analysis (Figs. 7A-B) reveals that Zmaxl is expressed in 
human bone tissue as well as numerous other tissues. A multiple-tissue Northern 
blot (Clontech, Palo Alto, CA) was probed with exons from Zmaxl . As shown in 

15 Fig. 7 A, the 5.5 kb Zmaxl transcript was highly expressed in heart, kidney, lung, 
liver and pancreas and is expressed at lower levels in skeletal muscle and brain. A 
second northern blot, shown in Fig. 7B, confirmed the transcript size at 5.5 kb, and 
indicated that Zmaxl is expressed in bone, bone marrow, calvaria and human 
osteoblastic cell lines. 

20 Taken together, these results coupled with the yeast two hybrid results 

indicate that the HBM polymorphism in the Zmaxl gene is responsible for the HBM 
phenotype, and that the Zmaxl gene is important in bone development. In addition, 
because mutation of Zmaxl can alter bone mineralization and development, it is 
likely that molecules that bind to Zmaxl may usefully alter bone development. 
25 Such molecules may include, for example, small molecules, proteins, RNA 
aptamers, peptide aptamers, and the like. 

XVII. Preparation of Nucleic Acids, Vectors, Transformations and Host Cells 

Large amounts of the nucleic acids of the present invention may be produced 
by replication in a suitable host cell. Natural or synthetic nucleic acid fragments 
30 coding for a desired fragment will be incorporated into recombinant nucleic acid 

constructs, usually DNA constructs, capable of introduction into and replication in a 
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prokaryotic or eukaryotic cell Usually the nucleic acid constructs will be suitable 
for replication in a unicellular host, such as yeast or bacteria, but may also be 
intended for introduction to (with and without integration within the genome) 
cultured mammalian or plant or other eukaryotic cell lines. The purification of 
5 nucleic acids produced by the methods of the present invention is described, for 
example, in Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd Ed. 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, 
Current Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also be produced by chemical 

10 synthesis, e.g., by the phosphoramidite method described by Beaucage et al, Tetra. 
Letts., 22:1859-1862 (1981) or the triester method according to Matteucci,et al, J. 
Am. Chem. Soc, 103:3185 (1981), and may be performed on commercial, automated 
oligonucleotide synthesizers. A double-stranded fragment may be obtained from the 
single-stranded product of chemical synthesis either by synthesizing the 

15 complementary strand and annealing the strands together under appropriate 

conditions or by adding the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaryotic or 
eukaryotic host may comprise a replication system recognized by the host, including 

20 the intended nucleic acid fragment encoding the desired protein, and will preferably 
also include transcription and translational initiation regulatory sequences operably 
linked to the protein encoding segment. Expression vectors may include, for 
example, an origin of replication or autonomously replicating sequence (ARS) and 
expression control sequences, a promoter, an enhancer and necessary processing 

25 information sites, such as ribosome-binding sites, RNA splice sites, polyadenylation 
sites, transcriptional terminator sequences, and mRNA stabilizing sequences. 
Secretion signals may also be included where appropriate, whether from a native 
HBM or Zmaxl protein or from other receptors or from secreted proteins of the 
same or related species, which allow the protein to cross and/or lodge in cell 

30 membranes, and thus attain its functional topology, or be secreted from the cell. 

Such vectors may be prepared by means of standard recombinant techniques well 
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known in the art and discussed, for example, in Sambrook et al, Molecular Cloning. 
A Laboratoiy Manual, 2nd Ed. (Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989) or Ausubel et al, Cun-ent Protocols in Molecular Biology, J. 
Wiley and Sons, NY (1992). 

An appropriate promoter and other necessary vector sequences will be 
selected so as to be functional in the host, and may include, when appropriate, those 
naturally associated with Zmaxl or HBM genes. Examples of workable 
combinations of cell lines and expression vectors are described in Sambrook et al, 
Molecular Cloning. A Laboratory Manual, 2nd Ed. (Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY (1989) or Ausubel et al, Current Protocols in Molecular 
Biology, J. Wiley and Sons, NY (1992). Many useful vectors are known in the art 
and may be obtained from such vendors as Stratagene, New England BioLabs, 
Promega Biotech, and others. Promoters such as the trp, lac and phage promoters, 
tRNA promoters and glycolytic enzyme promoters may be used in prokaryotic hosts. 
Useful yeast promoters include promoter regions for metallothionein, 3- 
phosphoglycerate kinase or other glycolytic enzymes such as enolase or 
glyceraldehyde-3 -phosphate dehydrogenase, enzymes responsible for maltose and 
galactose utilization, and others. Vectors and promoters suitable for use in yeast 
expression are further described in EP 73, 675 A. Appropriate non-native 
mammalian promoters might include the early and late promoters from S V40 (Fiers 
et al, Nature, 273:113 (1978)) or promoters derived from murine Moloney leukemia 
virus, mouse tumor virus, avian sarcoma viruses, adenovirus II, bovine papilloma 
virus or polyoma. In addition, the construct may be joined to an amplifiable gene 
(e.g., DHFR) so that multiple copies of the gene may be made. For appropriate 
enhancer and other expression control sequences, see also Enhancers and 
Eukaryotic Gene Expression, Cold Spring Harbor Press, Cold Spring Harbor, NY 
(1983). 

While such expression vectors may replicate autonomously, they may also 
replicate by being inserted into the genome of the host cell, by methods well known 
in the art. 



BNSDOCID: <WO. 



.0177327A1J_> 



Y 



WO 01/77327 



PCT/US00/16951 



-85- 

Expression and cloning vectors will likely contain a selectable marker, a 
gene encoding a protein necessary for survival or growth of a host cell transformed 
with the vector. The presence of this gene ensures growth of only those host cells 
which express the inserts. Typical selection genes encode proteins that a) confer 
5 resistance to antibiotics or other toxic substances, e.g. ampicillin, neomycin, 

methotrexate, etc.; b) complement auxotrophic deficiencies, or c) supply critical 
nutrients not available from complex media, e.g., the gene encoding D-alanine 
racemase for Bacilli. The choice of the proper selectable marker will depend on the 
host cell, and appropriate markers for different hosts are well known in the art. 

10 The vectors containing the nucleic acids of interest can be transcribed in 

vitro, and the resulting RNA introduced into the host cell by well-known methods, 
e.g., by injection (see, Kubo et al, FEBS Letts. 241:119 (1988)), or the vectors can 
be introduced directly into host cells by methods well known in the art, which vary 
depending on the type of cellular host, including electroporation; transfection 

15 employing calcium chloride, rubidium chloride, calcium phosphate, DEAE-dextran, 
or other substances; microprojectile bombardment; lipofection; infection (where the 
vector is an infectious agent, such as a retroviral genome); and other methods. See 
generally, Sambrook et al., 1989 and Ausubel et aL, 1992. The introduction of the 
nucleic acids into the host cell by any method known in the art, including those 

20 described above, will be referred to herein as "transformation." The cells into which 
have been introduced nucleic acids described above are meant to also include the 
progeny of such cells. 

Large quantities of the nucleic acids and proteins of the present invention 
may be prepared by expressing the Zmaxl or HBM nucleic acids or portions thereof 

25 in vectors or other expression vehicles in compatible prokaryotic or eukaryotic host 
cells. The most commonly used prokaryotic hosts are strains of Escherichia coli, 
although other prokaryotes, such as Bacillus subtilis or Pseudomonas may also be 
used. 

Mammalian or other eukaryotic host cells, such as those of yeast, 
30 filamentous fungi, plant, insect, or amphibian or avian species, may also be useful 
for production of the proteins of the present invention. Propagation of mammalian 
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cells in culture is per se well known. See, Jakoby and Pastan (eds.), Cell Culture. 
Methods in Enzymology, volume 58, Academic Press, Inc., Harcourt Brace 
Jovanovich, NY, (1979)). Examples of commonly used mammalian host cell lines 
are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, and WI38, BHK, 
and COS cell lines, although it will be appreciated by the skilled practitioner that 
other cell lines may be appropriate, e.g., to provide higher expression desirable 
glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, 
preferably the same DNA molecule. In prokaryotic hosts, the transformant may be 
selected, e.g., by resistance to ampicillin, tetracycline or other antibiotics. 
Production of a particular product based on temperature sensitivity may also serve as 
an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the nucleic acids of the 
present invention will be useful not only for the production of the nucleic acids and 
proteins of the present invention, but also, for example, in studying the 
characteristics of Zmaxl or HBM proteins. 

Antisense nucleic acid sequences arc useful in preventing or diminishing the 
expression of Zmaxl or HBM, as will be appreciated by one skilled in the art. For 
example, nucleic acid vectors containing all or a portion of the Zmaxl or HBM gene 
or other sequences from the Zmaxl or HBM region may be placed under the control 
of a promoter in an antisense orientation and introduced into a cell. Expression of 
such an antisense construct within a cell will interfere with Zmaxl or HBM 
transcription and/or translation and/or replication. 

The probes and primers based on the Zmaxl and HBM gene sequences 
disclosed herein are used to identify homologous Zmaxl and HBM gene sequences 
and proteins in other species. These Zmaxl and HBM gene sequences and proteins 
are used in the diagnostic/prognostic, therapeutic and drug screening methods 
described herein for the species from which they have been isolated. 
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XVIII. Protein Expression and Purification 

Expression and purification of the HBM protein of the invention can be 
performed essentially as outlined below. To facilitate the cloning, expression and 
purification of membrane and secreted protein from the HBM gene, a gene 
expression system, such as the pET System (Novagen), for cloning and expression 
of recombinant proteins in E. coli was selected. Also, a DNA sequence encoding a 
peptide tag, the His-Tap, was fused to the 3 f end of DNA sequences of interest to 
facilitate purification of the recombinant protein products. The 3' end was selected 
for fusion to avoid alteration of any 5' terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ 
ID NOS: 1, 3 and 5-12 for cloning HBM were prepared by polymerase chain 
reaction (PCR). Synthetic oligonucleotide primers specific for the 5 f and 3 1 ends of 
the HBM nucleotide sequence were designed and purchased from Life Technologies 
(Gaithersburg, MD). All forward primers (specific for the 5' end of the sequence) 
were designed to include an Ncol cloning site at the 5' terminus. These primers 
were designed to permit initiation of protein translation at the methionine residue 
encoded within the Ncol site followed by a valine residue and the protein encoded 
by the HBM DNA sequence. All reverse primers (specific for the 3' end of the 
sequence) included an EcoRI site at the 5 f terminus to permit cloning of the HBM 
sequence into the reading frame of the pET-28b. The pET-28b vector provided a 
sequence encoding an additional 20 carboxyl-terminal amino acids including six 
histidine residues (at the C-terminus), which comprised the histidine affinity tag. 

Genomic DNA prepared from the HBM gene was used as the source of 
template DNA for PCR amplification (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons (1994)). To amplify a DNA sequence 
containing the HBM nucleotide sequence, genomic DNA (50 ng) was introduced 
into a reaction vial containing 2 mM MgCl 2 , 1 \xM synthetic oligonucleotide primers 
(forward and reverse primers) complementary to and flanking a defined HBM, 0.2 
mM of each of deoxynucleotide triphosphate, dATP, dGTP, dCTP, dTTP and 2.5 
units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 
Branchburg, NJ) in a final volume of 100 microliters. 
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Upon completion of thermal cycling reactions, each sample of amplified 
DNA was purified using the Qiaquick Spin PCR purification kit (Qiagen, 
Gaithersburg, MD). All amplified DNA samples were subjected to digestion with 
the restriction endonucleases, e.g., Ncol and EcoRI (New England BioLabs, 
5 Beverly, MA) (Ausubel et al, Current Protocols in Molecular Biology, John Wiley 
& Sons, Inc. (1994)). DNA samples were then subjected to electrophoresis on 1.0% 
NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA was visualized by 
exposure to ethidium bromide and long wave UV irradiation. DNA contained in 
slices isolated from the agarose gel was purified using the Bio 101 GeneClean Kit 
10 protocol (Bio 101, Vista, CA). 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA) 
(Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons, Inc. 
(1994)). The pET-28a vector, which encodes the histidine affinity tag that can be 
15 fused to the 5 f end of an inserted gene, was prepared by digestion with appropriate 
restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)) into the 
previously digested pET-28b expression vector. Products of the ligation reaction 
20 were then used to transform the BL21 strain of E. coli (Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)) as described 
below. 

Competent bacteria, E. coli strain BL21 or E. coli strain BL21 (DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned HBM 

25 sequence according to standard methods (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)). Briefly, 1 jil of ligation 
reaction was mixed with 50 jxl of electrocompetent cells and subjected to a high 
voltage pulse, after which samples were incubated in 0.45 ml SOC medium (0.5% 
yeast extract, 2.0% tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 mM 

30 MgS0 4 and 20 mM glucose) at 37°C with shaking for 1 hour. Samples were then 
spread on LB agar plates containing 25 ng/ml kanamycin sulfate for growth 
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overnight. Transformed colonies of BL21 were then picked and analyzed to 
evaluate cloned inserts, as described below. 

Individual BL21 clones transformed with recombinant pET-28b HBM 
nucleotide sequences were analyzed by PCR amplification of the cloned inserts 
5 using the same forward and reverse primers specific for the HBM sequences that 
were used in the original PCR amplification cloning reactions. Successful 
amplification verifies the integration of the HBM sequence in the expression vector 
(Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons, Inc. 
(1994)). 

1 0 Individual clones of recombinant pET-28b vectors carrying properly cloned 

HBM nucleotide sequences were picked and incubated in 5 ml of LB broth plus 25 
|ng/ml kanamycin sulfate overnight. The following day plasmid DNA was isolated 
and purified using the Qiagen plasmid purification protocol (Qiagen Inc., 
Chatsworth, CA). 

15 The pET vector can be propagated in any E. coli K-12 strain, e.g., HMS174, 

HB101, JM109, DH5 and the like, for purposes of cloning or plasmid preparation. 
Hosts for expression include E. coli strains containing a chromosomal copy of the 
gene for T7 RNA polymerase. These hosts were lysogens of bacteriophage DE3, a 
lambda derivative that carries the lad gene, the lacUV5 promoter and the gene for 

20 T7 RNA polymerase. T7 RNA polymerase was induced by addition of isopropyl-p- 
D-thiogalactoside (IPTG), and the T7 RNA polymerase transcribes any target 
plasmid containing a functional T7 promoter, such as pET-28b, carrying its gene of 
interest. Strains include, for example, BL21(DE3) (Studier et al, Meth. EnzymoL, 
185:60-89 (1990)). 

25 To express the recombinant HBM sequence, 50 ng of plasmid DNA are 

isolated as described above to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression kit). The lacZ 
gene (P-galactosidase) is expressed in the pET-System as described for the HBM 
recombinant constructions. Transformed cells were cultured in SOC medium for 1 

30 hour, and the culture was then plated on LB plates containing 25 |ag/ml kanamycin 
sulfate. The following day, the bacterial colonies were pooled and grown in LB 
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medium containing kanamycin sulfate (25 ^ig/ml) to an optical density at 600 nM of 
0.5 to 1.0 O.D. units, at which point 1 mM IPTG was added to the culture for 3 
hours to induce gene expression of the HBM recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria were collected by 
5 centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. 
Pellets were resuspended in 50 ml of cold mM Tris-HCl, pH 8.0, 0.1 M NaCl and 
0. 1 mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 
minutes at 4°C. Wet pellets were weighed and frozen at -80°C until ready for 
protein purification. 

10 A variety of methodologies known in the art can he used to purify the 

isolated proteins (Coligan et al, Current Protocols in Protein Science, John Wiley & 
Sons (1995)). For example, the frozen cells can be thawed, resuspended in buffer 
and ruptured by several passages through a small volume microfluidizer (Model M- 
1 10S, Microfluidics International Corp., Newton, MA). The resultant homogenate 

15 is centrifuged to yield a clear supernatant (crude extract) and, following filtration, 

the crude extract is fractioned over columns. Fractions are monitored by absorbance 
at OD 280 nm and peak fractions may be analyzed by SDS-PAGE. 

The concentrations of purified protein preparations are quantified 
spectrophotometrically using absorbance coefficients calculated from amino acid 

20 content (Perkins, Eur. J. Biochem., 157:169-180 (1986)). Protein concentrations are 
also measured by the method of Bradford, Anal. Biochem., 72:248-254 (1976) and 
Lowry et al, J. Biol Chem., 193:265-275 (1951) using bovine serum albumin as a 
standard. 

SDS-polyacrylamide gels of various concentrations were purchased from 
25 BioRad (Hercules, CA), and stained with Coomassie blue. Molecular weight 
markers may include rabbit skeletal muscle myosin (200 kDa), E. coli P- 
galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum 
albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anyhdrase (31 kDa), 
soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine 
30 aprotinin (6.5 kDa). 
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Once a sufficient quantity of the desired protein has been obtained, it may be 
used for various purposes. A typical use is the production of antibodies specific for 
binding. These antibodies may be either polyclonal or monoclonal, and may be 
produced by in vitro or in vivo techniques well known in the art. Monoclonal 
5 antibodies to epitopes of any of the peptides identified and isolated as described can 
be prepared from murine hybridomas (Kohler, Nature, 256:495 (1975)). In 
summary, a mouse is inoculated with a few micrograms of HBM protein over a 
period of two weeks. The mouse is then sacrificed. The cells that produce 
antibodies are then removed from the mouse's spleen. The spleen cells are then 

10 fused with polyethylene glycol with mouse myeloma cells. The successfully fused 
cells are diluted in a microtiter plate and growth of the culture is continued. The 
amount of antibody per well is measured by immunoassay methods such as ELISA 
(Engvall, Meth. Enzymol, 70:419 (1980)). Clones producing antibody can be 
expanded and further propagated to produce HBM antibodies. Other suitable 

15 techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides, 
or alternatively, to selection of libraries of antibodies in phage or similar vectors. 
See Huse et al, Science, 246:1275-1281 (1989). For additional information on 
antibody production see Davis et al, Basic Methods in Molecular Biology, Elsevier, 
NY, Section 21-2 (1989). 

20 XIX. Methods of Use: Gene Therapy 

In recent years, significant technological advances have been made in the 
area of gene therapy for both genetic and acquired diseases. (Kay et al, Proa Natl. 
Acad. Sci. USA, 94:12744-12746 (1997)) Gene therapy can be defined as the 
deliberate transfer of DNA for therapeutic purposes. Improvement in gene transfer 

25 methods has allowed for development of gene therapy protocols for the treatment of 
diverse types of diseases. Gene therapy has also taken advantage of recent advances 
in the identification of new therapeutic genes, improvement in both viral and 
nonviral gene delivery systems, better understanding of gene regulation, and 
improvement in cell isolation and transplantation. 

30 The preceding experiments identify the HBM gene as a dominant mutation 

conferring elevated bone mass. The fact that this mutation is dominant indicates that 
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expression of the HBM protein causes elevated bone mass. Older individuals 
carrying the HBM gene, and, therefore expressing the HBM protein, do not suffer 
from osteoporosis. These individuals are equivalent to individuals being treated 
with the HBM protein. These observations are a strong experimental indication that 
5 therapeutic treatment with the HBM protein prevents osteoporosis. The bone mass 
elevating activity of the HBM gene is termed "HBM function." 

Therefore, according to the present invention, a method is also provided of 
supplying HBM function to mesenchymal stem cells (Onyia et al, J. Bone Miner, 
Res., 13:20-30 (1998); Ko et al, Cancer Res., 56:4614-4619 (1996)). Supplying 

10 such a function provides protection against osteoporosis. The HBM gene or a part 
of the gene may be introduced into the cell in a vector such that the gene remains 
extrachromosomal. In such a situation, the gene will be expressed by the cell from 
the extrachromosomal location. 

Vectors for introduction of genes both for recombination and for 

15 extrachromosomal maintenance are known in the art, and any suitable vector may be 
used. Methods for introducing DNA into cells such as electroporation, calcium 
phosphate co-precipitation, and viral transduction are known in the art, and the 
choice of method is within the competence of one skilled in the art (Robbins, Ed., 
Gene Therapy Protocols, Human Press, NJ (1997)). Cells transformed with the 

20 HBM gene can be used as model systems to study osteoporosis and drug treatments 
that promote bone growth. 

As generally discussed above, the HBM gene or fragment, where applicable, 
may be used in gene therapy methods in order to increase the amount of the 
expression products of such genes in mesenchymal stem cells. It may be useful also 

25 to increase the level of expression of a given HBM protein, or a fragment thereof, 

even in those cells in which the wild type gene is expressed normally. Gene therapy 
would be carried out according to generally accepted methods as described by, for 
example, Friedman, Tlierapy for Genetic Diseases, Friedman, Ed., Oxford 
University Press, pages 105-121 (1991). 

30 A virus or plasmid vector containing a copy of the HBM gene linked to 

expression control elements and capable of replicating inside mesenchymal stem 
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cells, is prepared. Suitable vectors are known and described, for example, in U.S. 
Patent No. 5,252,479 and WO 93/07282, the disclosures of which are incorporated 
by reference herein in their entirety. The vector is then injected into the patient, 
either locally into the bone marrow or systemically (in order to reach any 
5 mesenchymal stem cells located at other sites, i.e., in the blood). If the transfected 
gene is not permanently incorporated into the genome of each of the targeted cells, 
the treatment may have to be repeated periodically. 

Gene transfer systems known in the art may be useful in the practice of the 
gene therapy methods of the present invention. These include viral and non- viral 

1 0 transfer methods. A number of viruses have been used as gene transfer vectors, 

including polyoma, i.e., SV40 (Madzak et al, J. Gen. Virol, 73:1533-1536 (1992)), 
adenovirus (Berkner, Curr. Top. Microbiol Immunol, 158:39-61 (1992); Berkner et 
al, Bio Techniques, 6:616-629 (1988); Gorziglia et al, J. Virol, 66:4407-4412 
(1992); Quantin et al, Proc. Natl Acad. Sci. USA, 89:2581-2584 (1992); Rosenfeld 

15 et al, Cell, 68:143-155 (1992); Wilkinson et al, Nucl Acids Res., 20:2233-2239 

(1992); Stratford-Penicaudet et al, Hum. Gene Titer., 1:241-256 (1990)), vaccinia 
virus (Mackett et al, Biotechnology, 24:495-499 (1992)), adeno-associated virus 
(Muzyczka, Curr. Top. Microbiol Immunol, 158:91-123 (1992); Ohi et al, Gene, 
89:279-282 (1990)), herpes viruses including HSV and EBV (Margolskee, Curr. 

20 Top. Microbiol Immunol, 158:67-90 (1992); Johnson et al, J. Virol, 66:2952-2965 
(1992); Fink et al, Hum. Gene Titer., 3:11-19 (1992); Breakfield et al, Mol 
Neurobiol, 1:337-371 (1987;) Fresse et al, Biochem. Pharmacol, 40:2189-2199 
(1990)), and retroviruses of avian (Brandyopadhyay et al, Mol Cell Biol, 4:7 '49-7 '54 

(1984) ; Petropouplos et al, J. Virol, 66:3391-3397 (1992)), murine (Miller, Curr. 
25 Top. Microbiol. Immunol. , 1 5 8 : 1 -24 (1 992); Miller et al, Mol Cell Biol. , 5 :43 1 -437 

(1985) ; Sorge et al, Mol. Cell Biol, 4:1730-1737 (1984); Mann et al, J. Virol, 
54:401-407 (1985)), and human origin (Page et al, J. Virol, 64:5370-5276 (1990); 
Buchschalcher et al, J. Virol, 66:2731-2739 (1992)). Most human gene therapy 
protocols have been based on disabled murine retroviruses. 

30 Non- viral gene transfer methods known in the art include chemical 

techniques such as calcium phosphate coprecipitation (Graham et al, Virology, 
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52:456-467 (1973); Pellicer et al, Science, 209:1414-1422 (1980)), mechanical 
techniques, for example microinjection (Anderson et al, Proc. Natl. Acad, ScL USA, 
77:5399-5403 (1980); Gordon et al, Proc. Natl. Acad. ScL USA, 77:7380-7384 
(1980); Brinster et al, Cell, 27:223-231 (1981); Constantini et al, Nature, 294:92-94 
5 (1981)), membrane fusion-mediated transfer via liposomes (Feigner et al, Proc. 
Natl Acad. Sci. USA, 84:7413-7417 (1987); Wang et al, Biochemistry, 28:9508- 
9514 (1989); Kaneda et al, J. Biol. Chem., 264:12126-12129 (1989); Stewart et al, 
Hum. Gene Ther., 3:267-275 (1992); Nabel et al, Science, 249:1285-1288 (1990); 
Lim et al, Circulation, 83:2007-2011 (1992)), and direct DNA uptake and receptor- 

10 mediated DNA transfer (Wolff et al, Science, 247:1465-1468 (1990); Wu et al, 
BioTechniques, 11:474-485 (1991); Zenke et al, Proc. Natl Acad. Sci. USA, 
87:3655-3659 (1990); Wu et al, J. Biol Chem., 264:16985-16987 (1989); Wolff et 
al, BioTechniques, 11:474-485 (1991); Wagner et al, 1990; Wagner et al, Proc. 
Natl Acad. Sci. USA, 88:4255-4259 (1991); Cotten et al, Proc. Natl Acad. Sci. 

15 USA, 87:4033-4037 (1990); Curiel et al, Proc. Natl Acad. Sci. USA, 88:8850-8854 
(1991); Curiel et al, Hum. Gene Titer., 3:147-154 (1991)). Viral-mediated gene 
transfer can be combined with direct in vivo vectors to the mesenchymal stem cells 
and not into the surrounding cells (Romano et al, In Vivo, 12(l):59-67 (1998); 
Gonez et al, Hum. Mol Genetics, 7(12):1913-9 (1998)). Alternatively, the retroviral 

20 vector producer cell line can be injected into the bone marrow (Culver et al, Science, 
256:1550-1552(1 992)) . Inj ection of producer cells would then provide a continuous 
source of vector particles. This technique has been approved for use in humans with 
inoperable brain tumors. 

In an approach which combines biological and physical gene transfer 

25 methods, plasmid DNA of any size is combined with a polylysine-conjugated 
antibody specific to the adenovirus hexon protein, and the resulting complex is 
bound to an adenovirus vector. The trimolecular complex is then used to infect 
cells. The adenovirus vector permits efficient binding, internalization, and 
degradation of the endosome before the coupled DNA is damaged. 

30 Liposome/DNA complexes have been shown to be capable of mediating 

direct in vivo gene transfer. While in standard liposome preparations the gene 
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transfer process is non-specific, localized in vivo uptake and expression have been 
reported in tumor deposits, for example, following direct in situ administration 
(Nabel, Hum. Gene Titer., 3:399-410 (1992)). 

XX, Methods of Use: Transformed Hosts, Development of Pharmaceuticals 
5 and Research Tools 

Cells and animals that carry the HBM gene can be used as model systems to 
study and test for substances that have potential as therapeutic agents (Onyia et al, J. 
Bone Miner. Res., 13:20-30 (1998); Broder et al, Bone, 21:225-235 (1997)). The 
cells are typically cultured mesenchymal stem cells. These may be isolated from 

10 individuals with somatic or germline HBM genes. Alternatively, the cell line can be 
engineered to cany the HBM gene, as described above. After a test substance is 
applied to the cells, the transformed phenotype of the cell is determined. Any trait 
of transformed cells can be assessed, including formation of bone matrix in culture 
(Broder et al, Bone, 21:225-235 (1997)), mechanical properties (Kizer et al, Proc. 

15 Natl Acad. Sci. USA, 94:1013-1018 (1997)), and response to application of putative 
therapeutic agents. 

Animals for testing therapeutic agents can be selected after treatment of 
germline cells or zygotes. Such treatments include insertion of the Zmaxl gene, as 
well as insertion of the HBM gene and disrupted homologous genes. Alternatively, 

20 the inserted Zmaxl gene(s) and/or HBM gene(s) of the animals may be disrupted by 
insertion or deletion mutation of other genetic alterations using conventional 
techniques, such as those described by, for example, Capechi, Science, 244:1288 
(1989); Valancuis et al, Mol. Cell Biol, 11:1402 (1991); Hasty et al, Nature, 
350:243 (1991); Shinkai et al, Cell, 68:855 (1992); Mombaerts et al, Cell, 68:869 

25 (1992); Philpott et al, Science, 256:1448 (1992); Snouwaert et al, Science, 257:1083 
(1992); Donehower et al, Nature, 356:215 (1992). After test substances have been 
administered to the animals, the growth of bone must be assessed. If the test 
substance enhances the growth of bone, then the test substance is a candidate 
therapeutic agent. These animal models provide an extremely important vehicle for 

30 potential therapeutic products. 
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Individuals carrying the HBM gene have elevated bone mass. The HBM 
gene causes this phenotype by altering the activities, levels, expression patterns, and 
modification states of other molecules involved in bone development. Using a 
variety of established techniques, it is possible to identify molecules, preferably 
5 proteins or mRNAs, whose activities, levels, expression patterns, and modification 
states are different between systems containing the Zmax 1 gene and systems 
containing the HBM gene. Such systems can be, for example, cell-free extracts, 
cells, tissues or living organisms, such as mice or humans. For a mutant fomi of 
Zmaxl, a complete deletion of Zmaxl, mutations lacking the extracellular or 

10 intracellular portion of the protein, or any other mutation in the Zmaxl gene may be 
used. It is also possible to use expression of antisense Zmaxl RNA or 
oligonucleotides to inhibit production of the Zmaxl protein. For a mutant form of 
HBM, a complete deletion of HBM, mutations lacking the extracellular or 
intracellular portion of the HBM protein, or any other mutation in the HBM gene 

15 may be used. It is also possible to use expression of antisense HBM RNA or 
oligonucleotides to inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems 
can be used as surrogate markers in pharmaceutical development or in diagnosis of 
human or animal bone disease. Alternatively, such molecules may be used in 

20 treatment of bone disease. See, Schena et al, Science, 270:467-470 (1995). 

For example, a transgenic mouse carrying the HBM gene in the mouse 
homologue is constructed. A mouse of the genotype HBM/+ is viable, healthy and 
has elevated bone mass. To identify surrogate markers for elevated bone mass, 
HBM/+ (i.e., heterozygous) and isogenic +/+ (i.e., wild-type) mice are sacrificed. 

25 Bone tissue mRNA is extracted from each animal, and a "gene chip" corresponding 
to mRNAs expressed in the +/+ individual is constructed. mRNA from different 
tissues is isolated from animals of each genotype, reverse-transcribed, fluorescently 
labeled, and then hybridized to gene fragments affixed to a solid support. The ratio 
of fluorescent intensity between the two populations is indicative of the relative 

30 abundance of the specific mRNAs in the +/+ and HBM/+ animals. Genes encoding 
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mRNAs over- and under-expressed relative to the wild-type control are candidates 
for genes coordinately regulated by the HBM gene. 

One standard procedure for identification of new proteins that are part of the 
same signaling cascade as an already-discovered protein is as follows. Cells are 
5 treated with radioactive phosphorous, and the already-discovered protein is 

manipulated to be more ore less active. The phosphorylation state of other proteins 
in the cell is then monitored by polyacrylamide gel electrophoresis and 
autoradiography, or similar techniques. Levels of activity of the known protein may 
be manipulated by many methods, including, for example, comparing wild-type 

10 mutant proteins using specific inhibitors such as drugs or antibodies, simply adding 
or not adding a known extracellular protein, or using antisense inhibition of the : 
expression of the known protein (Tamura et al, Science, 280(5369): 1614-7 (1998); 
Meng, EMBOJ., 17(15):4391-403 (1998); Cooper et al, Cell 1:263-73 (1982)). 
In another example, proteins with different levels of phosphorylation are 

15 identified in TE85 osteosarcoma cells expressing either a sense or antisense cDNA 
for Zmaxl . TE85 cells normally express high levels of Zmaxl (Dong et al, 
Biochem. & Biophys. Res. Comm., 251:784-790 (1998)). Cells containing the sense 
construct express even higher levels of Zmaxl, while cells expressing the antisense 
construct express lower levels. Cells are grown in the presence of 32 P, harvested, 

20 lysed, and the lysates run on SDS polyacrylamide gels to separate proteins, and the 
gels subjected to autoradiography (Ausubel et al, Current Protocols in Molecular 
Biology, John Wiley & Sons (1997)). Bands that differ in intensity between the 
sense and antisense cell lines represent phosphoproteins whose phosphorylation state 
or absolute level varies in response to levels of Zmaxl. As an alternative to the 32 P- 

25 labeling, unlabeled proteins may be separated by SDS-PAGE and subjected to 

immunoblotting, using the commercially available anti-phosphotyrosine antibody as 
a probe (Thomas et al, Nature, 376(6537):267-71 (1995)). As an alternative to the 
expression of antisense RNA, transfection with chemically modified antisense 
oligonucleotides can be used (Woolf et al, Nucleic Acids Res., 18(7):1763-9 (1990)). 

30 Many bone disorders, such as osteoporosis, have a slow onset and a slow 

response to treatment. It is therefore useful to develop surrogate markers for bone 
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development and mineralization. Such markers can be useful in developing 
treatments for bone disorders, and for diagnosing patients who may be at risk for 
later development of bone disorders. Examples of preferred markers are N- and C- 
terminal telopeptide markers described, for example, in U.S. Patent Nos. 5,455,179, 
5 5,641,837 and 5,652,1 12, the disclosures of which are incorporated by reference 
herein in their entirety. In the area of HIV disease, CD4 counts and viral load are 
useful surrogate markers for disease progression (Vlahov et al, JAMA, 279(l):35-40 
(1998)). There is a need for analogous surrogate markers in the area of bone disease. 
A surrogate marker can be any characteristic that is easily tested and 
10 relatively insensitive to non-specific influences. For example, a surrogate marker 
can be a molecule such as a protein or mKNA in a tissue or in blood serum. 
Alternatively, a surrogate marker may be a diagnostic sign such as sensitivity to 
pain, a reflex response or the like. 

In yet another example, surrogate markers for elevated bone mass are 
15 identified using a pedigree of humans carrying the HBM gene. Blood samples are 
withdrawn from three individuals that carry the HBM gene, and from three closely 
related individuals that do not. Proteins in the serum from these individuals are 
electrophoresed on a two dimensional gel system, in which one dimension separates 
proteins by size, and another dimension separates proteins by isoelectric point 
20 (Epstein et al, Electrophoresis, 17(1 1): 1655-70 (1996)). Spots corresponding to 

proteins are identified. A few spots are expected to be present in different amounts 
or in slightly different positions for the HBM individuals compared to their normal 
relatives. These spots correspond to proteins that are candidate surrogate markers. 
The identities of the proteins are determined by microsequencing, and antibodies to 
25 the proteins can be produced by standard methods for use in diagnostic testing 
procedures. Diagnostic assays for HBM proteins or other candidate surrogate 
markers include using antibodies described in this invention and a reporter molecule 
to detect HBM in human body fluids, membranes, bones, cells, tissues or extracts 
thereof. The antibodies can be labeled by joining them covalently or noncovalently 
30 with a substance that provides a detectable signal. In many scientific and patent 
literature, a variety of reporter molecules or labels are described including 
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radionuclides, enzymes, fluorescent, chemi-Iuminescent or chromogenic agents 
(U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241). 

Using these antibodies, the levels of candidate surrogate markers are 
5 measured in normal individuals and in patients suffering from a bone disorder, such 
as osteoporosis, osteoporosis pseudoglioma, Engelmann's disease, Ribbing's disease, 
hyperphosphatasemia, Van Buchem's disease, melorheostosis, osteopetrosis, 
pychodysostosis, sclerosteosis, osteopoikilosis, acromegaly, Paget's disease, fibrous 
dysplasia, tubular stenosis, osteogenesis imperfecta, hypoparathyroidism, 

10 pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary and 

secondary hyperparathyroidism and associated syndromes, hypercalciuria, medullary 
carcinoma of the thyroid gland, osteomalacia and other diseases. Techniques for 
measuring levels of protein in serum in a clinical setting using antibodies are well 
established. A protein that is consistently present in higher or lower levels in 

15 individuals carrying a particular disease or type of disease is a useful surrogate 
marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For 
example, consider a child that present to a physician with a high frequency of bone 
fracture. The underlying cause may be child abuse, inappropriate behavior by the 
20 child, or a bone disorder. To rapidly test for a bone disorder, the levels of the 
surrogate marker protein are measured using the antibody described above. 

Levels of modification states of surrogate markers can be measured as 
indicators of the likely effectiveness of a drug that is being developed. It is 
especially convenient to use surrogate markers in creating treatments for bone 
25 disorders, because alterations in bone development or mineralization may require a 
long time to be observed. For example, a set of bone mRNAs, termed the "HBM- 
inducible mRNA set" is found to be overexpressed in HBM/+ mice as compared to 
+/+ mice, as described above. Expression of this set can be used as a surrogate 
marker. Specifically, if treatment of +/+ mice with a compound results in 
30 overexpression of the HBM-inducible mRNA set, then that compound is considered 
a promising candidate for further development. 
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This invention is particularly useful for screening compounds by using the 
Zmaxl or HBM protein or binding fragment thereof in any of a variety of drug 
screening techniques. 

The Zmaxl or HBM protein or fragment employed in such a test may either 
5 be free in solution, affixed to a solid support, or borne on a cell surface. One 

method of drug screening utilizes eukaryotic or prokaryotic host cells which are 
stably transformed with recombinant nucleic acids expressing the protein or 
fragment, preferably in competitive binding assays. Such cells, either in viable or 
fixed form, can be used for standard binding assays. One may measure, for 

10 example, for the formation of complexes between a Zmaxl or HBM protein or 

fragment and the agent being tested, or examine the degree to which the formation 
of a complex between a Zmaxl or HBM protein or fragment and a known ligand is 
interfered with by the agent being tested. 

Thus, the present invention provides methods of screening for drugs 

15 comprising contacting such an agent with a Zmaxl or HBM protein or fragment 
thereof and assaying (i) for the presence of a complex between the agent and the 
Zmaxl or HBM protein or fragment, or (ii) for the presence of a complex between 
the Zmaxl or HBM protein or fragment and a ligand, by methods well known in the 
art. In such competitive binding assays the Zmaxl or HBM protein or fragment is 

20 typically labeled. Free Zmaxl or HBM protein or fragment is separated from that 
present in a protein:protein complex, and the amount of free (i.e., uncomplexed) 
label is a measure of the binding of the agent being tested to Zmaxl or HBM or its 
interference with Zmaxl or HBM: ligand binding, respectively. 

Another technique for drug screening provides high throughput screening for 

25 compounds having suitable binding affinity to the Zmaxl or HBM proteins and is 

described in detail in WO 84/03564. Briefly stated, large numbers of different small 
peptide test compounds are synthesized on a solid substrate, such as plastic pins or 
some other surface. The peptide test compounds are reacted with Zmaxl or HBM 
proteins and washed. Bound Zmaxl or HBM protein is then detected by methods 

30 well known in the art. Purified Zmaxl or HBM can be coated directly onto plates 

for use in the aforementioned drug screening techniques. However, non-neutralizing 
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antibodies to the protein can be used to capture antibodies to immobilize the Zmaxl 
or HBM protein on the solid phase. 

This invention also contemplates the use of competitive drug screening 
assays in which neutralizing antibodies capable of specifically binding the Zmaxl or 
5 HBM protein compete with a test compound for binding to the Zmaxl or HBM 

protein or fragments thereof. In this manner, the antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic determinants of the Zmaxl 
or HBM protein. 

A further technique for drug screening involves the use of host eukaryotic 

10 cell lines or cells (such as described above) that have a nonfunctional Zmaxl or 
HBM gene. These host cell lines or cells are defective at the Zmaxl or HBM 
protein level. The host cell lines or cells are grown in the presence of drug 
compound. The rate of growth of the host cells is measured to determine if the 
compound is capable of regulating the growth of Zmaxl or HBM defective cells. 

15 The goal of rational drug design is to produce structural analogs of 

biologically active proteins of interest or of small molecules with which they interact 
(e.g., agonists, antagonists, inhibitors) in order to fashion drugs which are, for 
example, more active or stable forms of the protein, or which, e.g., enhance or 
interfere with the function of a protein in vivo. See, e.g., Hodgson, Bio/Technology, 

20 9:19-21 (1991). In one approach, one first determines the three-dimensional 

structure of a protein of interest (e.g., Zmaxl or HBM protein) or, for example, of 
the Zmaxl- or HBM-receptor or ligand complex, by x-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, 
useful information regarding the structure of a protein may be gained by modeling 

25 based on the structure of homologous proteins. An example of rational drug design 
is the development of HIV protease inhibitors (Erickson et al, Science, 249:527-533 
(1990)). In addition, peptides (e.g., Zmaxl or HBM protein) are analyzed by an 
alanine scan (Wells, Methods in EnzymoL, 202: 390-41 1 (1991)). In this technique, 
an amino acid residue is replaced by Ala, and its effect on the peptide's activity is 

30 determined. Each of the amino acid residues of the peptide is analyzed in this 
manner to determine the important regions of the peptide. 
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It is also possible to isolate a target-specific antibody, selected by a 
functional assay, and then to solve its crystal structure. In principle, this approach 
yields a pharmacore upon which subsequent drug design can be based. It is possible 
to bypass protein crystallography altogether by generating anti-idiotypic antibodies 
5 (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a 
mirror image, the binding site of the anti-ids would be expected to be an analog of 
the original receptor. The anti-id could then be used to identify and isolate peptides 
from banks of chemically or biologically produced banks of peptides. Selected 
peptides would then act as the pharmacore. 

10 Thus, one may design drugs which have, e.g., improved Zmaxl or HBM 

protein activity or stability or which act as inhibitors, agonists, antagonists, etc. of 
Zmaxl or HBM protein activity. By virtue of the availability of cloned Zmaxl or 
HBM sequences, sufficient amounts of the Zmaxl or HBM protein may be made 
available to perform such analytical studies as x-ray crystallography. In addition, 

15 the knowledge of the Zmaxl or HBM protein sequence provided herein will guide 
those employing computer modeling techniques in place of, or in addition to x-ray 
crystallography. 

XXI. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl DNA and Zmaxl protein and/or the HBM DNA and HBM 
20 protein can be used for vertebrate and preferably human therapeutic agents and for 
avian and mammalian veterinary agents, including for livestock breeding. Birds, 
including, for example, chickens, roosters, hens, turkeys, ostriches, ducks, pheasants 
and quails, can benefit from the identification of the gene and pathway for high bone 
mass. In many examples cited in literature (for example, McCoy et al, Res. Vet. Set, 
25 60(2): 185-186 (1996)), weakened bones due to husbandry conditions cause cage 

layer fatigue, osteoporosis and high mortality rates. Additional therapeutic agents to 
treat osteoporosis or other bone disorders in birds can have considerable beneficial 
effects on avian welfare and the economic conditions of the livestock industry, 
including, for example, meat and egg production. 
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XXII. Methods of use: Diagnostic assays using Zmaxl-specific 
oligonucleotides for detection of genetic alterations affecting bone development 

In cases where an alteration or disease of bone development is suspected to 
involve an alteration of the Zmaxl gene or the HBM gene, specific oligonucleotides 
5 may be constructed and used to assess the level of Zmaxl mRNA or HBM mRNA, 
respectively, in bone tissue or in another tissue that affects bone development. 

For example, to test whether a person has the HBM gene, which affects bone 
density, polymerase chain reaction can be used. Two oligonucleotides are 
synthesized by standard methods or are obtained from a commercial supplier of 

1 0 custom-made oligonucleotides. The length and base composition are determined by 
standard criteria using the Oligo 4.0 primer Picking program (Wojchich Rychlik, 
1992). One of the oligonucleotides is designed so that it will hybridize only to 
HBM DNA under the PCR conditions used. The other oligonucleotide is designed . 
to hybridize a segment of Zmaxl genomic DNA such that amplification of DNA 

1 5 using these oligonucleotide primers produces a conveniently identified DNA 

fragment. For example, the pair of primers CCAAGTTCTGAGAAGTCC (SEQ ID r. 
NO:32) and AATACCTGAAACCATACCTG (SEQ ID NO:33) will amplify a 530 
base pair DNA fragment from a DNA sample when the following conditions are 
used: step 1 at 95 °C for 120 seconds; step 2 at 95 °C for 30 seconds; step 3 at 58 °C 

20 for 30 seconds; step 4 at 72 °C for 120 seconds; where steps 2-4 are repeated 35 
times. Tissue samples may be obtained from hair follicles, whole blood, or the 
buccal cavity. 

The fragment generated by the above procedure is sequenced by standard 
techniques. Individuals heterozygous for the HBM gene will show an equal amount 

25 of G and T at the second position in the codon for glycine 171. Normal or 
homozygous wild-type individuals will show only G at this position. 
Other amplification techniques besides PCR may be used as alternatives, such as 
ligation-mediated PCR or techniques involving Q-beta replicase (Cahill et al, Clin. 
Chem., 37(9):1482-5 (1991)). For example, the oligonucleotides AGCTGCTCGT 

30 AGCTG TCTCTCCCTGGATCACGGGTACATGTACTGGACAGACTGGGT 

(SEQ ID NO:34) and TGAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTA 
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TTCCCTGTGCCGCATTACGGC (SEQ ID NO:35) can be hybridized to a 
denatured human DNA sample, treated with a DNA ligase, and then subjected to 
PCR amplification using the primer oligonucleotides AGCTGCTCGTAGCTGTCT 
CTCCCTGGA (SEQ ID NO:36) and GCCGTAATGCGGCACAGGGAATAAGCT 
5 (SEQ ID NO:37). In the first two oligonucleotides, the outer 27 bases are random 

sequence corresponding to primer binding sites, and the inner 30 bases correspond to 
sequences in the Zmaxl gene. The T at the end of the first oligonucleotide 
corresponds to the HBM gene. The first two oligonucleotides are ligated only when 
hybridized to human DNA carrying the HBM gene, which results in the formation of 

10 an amplifiable 1 14 bp DNA fragment. 

Products of amplification can be detected by agarose gel electrophoresis, 
quantitative hybridization, or equivalent techniques for nucleic acid detection known 
to one skilled in the art of molecular biology (Sambrook et al, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring, NY (1989)). 

15 Other alterations in the Zmaxl gene or the HBM gene may be diagnosed by 

the same type of amplification-detection procedures, by using oligonucleotides 
designed to identify those alterations. These procedures can be used in animals as 
well as humans to identify alterations in Zmaxl or HBM that affect bone 
development. 

20 Expression of Zmaxl or HBM in bone tissue may be accomplished by fusing 

the cDNA of Zmaxl or HBM, respectively, to a bone-specific promoter in the 
context of a vector for genetically engineering vertebrate cells. DNA constructs are 
introduced into cells by packaging the DNA into virus capsids, by the use of cationic 
liposomes, electroporation, or by calcium phosphate transfection. Transfected cells, 

25 preferably osteoblasts, may be studied in culture or may be introduced into bone 
tissue in animals by direct injection into bone or by intravenous injection of 
osteoblasts, followed by incorporation into bone tissue (Ko et al, Cancer Research, 
56(20):4614-9 (1996)). For example, the osteocalcin promoter, which is specifically 
active in osteoblasts, may be used to direct transcription of the Zmaxl gene or the 

30 HBM gene. Any of several vectors and transfection methods may be used, such as 
retroviral vectors, adenovirus vectors, or vectors that are maintained after 
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transfection using cationic liposomes, or other methods and vectors described 
herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects 
the level of bone mineralization. By manipulating levels of functional Zmaxl 
protein or HBM protein, it is possible to affect bone development and to increase or 
decrease levels of bone mineralization. For example, it may be useful to increase 
bone mineralization in patients with osteoporosis. Alternatively, it may be useful to 
decrease bone mineralization in patients with osteopetrosis or Paget f s disease. 
Alteration of Zmaxl levels or HBM levels can also be used as a research tool. 
Specifically, it is possible to identify proteins, mRNA and other molecules whose 
level or modification status is altered in response to changes in functional levels of v. 
Zmaxl or HBM. The pathology and pathogenesis of bone disorders is known and 
described, for example, in Rubin and Farber (Eds.), Pathology, 2nd Ed., S.B. 
Lippincott Co., Philadelphia, PA (1994). 

A variety of techniques can be used to alter the levels of functional Zmaxl or 
HBM. For example, intravenous or intraosseous injection of the extracellular 
portion of Zmaxl or mutations thereof, or HBM or mutations thereof, will alter the - 
level of Zmaxl activity or HBM activity, respectively, in the body of the treated 
human, animal or bird. Truncated versions of the Zmaxl protein or HBM protein 
can also be injected to alter the levels of functional Zmaxl protein or HBM protein, 
respectively. Certain forms of Zmaxl or HBM enhance the activity of endogenous 
protein, while other forms are inhibitory. 

In a preferred embodiment, the HBM protein is used to treat osteoporosis. In 
a further preferred embodiment, the extracellular portion of the HBM protein is 
used. This HBM protein may be optionally modified by the addition of a moiety 
that causes the protein to adhere to the surface of cells. The protein is prepared in a 
pharmaceutically acceptable solution and is administered by injection or another 
method that achieves acceptable pharmacokinetics and distribution. 

In a second embodiment of this method, Zmaxl or HBM levels are increased 
or decreased by gene therapy techniques. To increase Zmaxl or HBM levels, 
osteoblasts or another useful cell type are genetically engineered to express high 
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levels of Zmaxl or HBM as described above. Alternatively, to decrease Zmaxl or 
HBM levels, antisense constructs that specifically reduce the level of translatable 
Zmaxl or HBM mRNA can be used. In general, a tissue-nonspecific promoter may 
be used, such as the CMV promoter or another commercially available promoter 
5 found in expression vectors (Wu et al, Toxicol Appl Pharmacol, 141(l):330-9 

(1996)). In a preferred embodiment, a Zmaxl cDNA or its antisense is transcribed 
by a bone-specific promoter, such as the osteocalcin or another promoter, to achieve 
specific expression in bone tissue. In this way, if a Zmaxl -expressing DNA 
construct or HBM-expressing construct is introduced into non-bone tissue, it will not 
10 be expressed. 

In a third embodiment of this method, antibodies against Zmaxl or HBM are 
used to inhibit its function. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that inhibit Zmaxl function or 
HBM function are used. Such drugs are described herein and optimized according 
15 to techniques of medicinal chemistry well known to one skilled in the art of 
pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules 
that inhibit the interaction between Zmaxl or HBM and ApoE or another binding 
partner are expected to alter bone development and mineralization. Such inhibitors 
20 may be useful as drugs in the treatment of osteoporosis, osteopetrosis, or other 
diseases of bone mineralization. Such inhibitors may be low molecular weight 
compounds, proteins or other types of molecules. See, Kim et al, J. Biochem. 
(Tokyo), 124(6): 1072-1076 (1998). 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins 
25 may be isolated by standard drug-screening techniques. For example, Zmaxl 
protein, (or a fragment thereof) or HBM protein (or a fragment thereof) can be 
immobilized on a solid support such as the base of microtiter well. A second protein 
or protein fragment, such as ApoE is derivatized to aid in detection, for example 
with fluorescein. Iodine, or biotin, then added to the Zmaxl or HBM in the presence 
30 of candidate compounds that may specifically inhibit this protein-protein domain of 
Zmaxl or HBM, respectively, and thus avoid problems associated with its 
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transmembrane segment. Drug screens of this type are well known to one skilled in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development, proteins that 
bind to Zmaxl and HBM are also expected to be involved in bone development. 
Such binding proteins can be identified by standard methods, such as co- 
immunoprecipitation, co-fractionation, or the two-hybrid screen (Ausubel et al, 
Current Protocols in Molecular Biology, John Wiley & Sons (1997)). For example, 
to identify Zmaxl -interacting proteins or HBM-interacting proteins using the two- 
hybrid system, the extracellular domain of Zmaxl or HBM is fused to Lex A and 
expressed for the yeast vector pEG202 (the "bait") and expressed in the yeast strain 
EGY48. The yeast strain is transformed with a "prey" library in the appropriate 
vector, which encodes a galactose-inducible transcription-activation sequence fused 
to candidate interacting proteins. The techniques for initially selecting and 
subsequently verifying interacting proteins by this method are well known to one 
skilled in the art of molecular biology (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons (1997)). 

In a preferred embodiment, proteins that interact with HBM, but not Zmaxl, 
are identified using a variation of the above procedure (Xu et al, Proc. Natl Acad. 
Set USA, 94(23): 12473-8 (Nov. 1997)). This variation of the two-hybrid system 
uses two baits, and Zmaxl and HBM are each fused to LexA and TetR, respectively. 
Alternatively, proteins that interact with the HBM but not Zmaxl are also isolated. 
These procedures are well known to one skilled in the art of molecular biology, and 
are a simple variation of standard two-hybrid procedures. 

As an alternative method of isolating Zmaxl or HBM interacting proteins, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof, such as the 
extracellular domain, or the HBM protein or a fragment thereof, such as the 
extracellular domain, is chemically coupled to Sepharose beads. The Zmaxl- or 
HBM-coupled beads are poured into a column. An extract of proteins, such as 
serum proteins, proteins in the supernatant of a bone biopsy, or intracellular proteins 
from gently lysed TE85 osteoblastic cells, is added to the column. Non-specifically 
bound proteins are eluted, the column is washed several times with a low-salt buffer, 
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and then tightly binding proteins are eluted with a high-salt buffer. These are 
candidate proteins that bind to Zmaxl or HBM, and can be tested for specific 
binding by standard tests and control experiments. Sepharose beads used for 
coupling proteins and the methods for performing the coupling are commercially 
5 available (Sigma), and the procedures described here are well known to one skilled 
in the art of protein biochemistry. 

As a variation of the above procedure, proteins that are eluted by high salt 
from the Zmaxl- or HBM-Sepharose column are then added to an HBM-Zmaxl- 
sepharose column. Proteins that flow through without sticking are proteins that bind 
10 to Zmaxl but not to HBM. Alternatively, proteins that bind to the HBM protein and 
not to the Zmaxl protein can be isolated by reversing the order in which the columns 
are used. 

XXHL Method of Use: Transformation-Associated Recombination (TAR) 
Cloning 

15 Essential for the identification of novel allelic variants of Zmaxl is the 

ability to examine the sequence of both copies of the gene in an individual. To 
accomplish this, two "hooks," or regions of significant similarity, are identified 
within the genomic sequence such that they flank the portion of DNA that is to be 
cloned. Most preferably, the first of these hooks is derived from sequences 5 f to the 

20 first exon of interest and the second is derived from sequences 3 ! to the last exon of 
interest. These two "hooks" are cloned into a bacterial/yeast shuttle vector such as 
that described by Larionov et al, Proc. Natl Acad. Set USA, 94:7384-7387 (1997). 
Other similar vector systems may also be used. To recover the entire genomic copy 
of the Zmaxl gene, the plasmid containing the two "hooks" is linearized with a 

25 restriction endonuclease or is produced by another method such as PCR. This linear 
DNA fragment is introduced into yeast cells along with human genomic DNA. 
Typically, the yeast Saccharomyces cerevisiae is used as a host cell, although 
Larionov et al (in press) have reported using chicken host cells as well. During and 
after the process of transformation, the endogenous host cell converts the linear 

30 plasmid to a circle by a recombination event whereby the region of the human 
genomic DNA homologous to the "hooks" is inserted into the plasmid. This 
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plasmid can be recovered and analyzed by methods well known to one skilled in the 
art. Obviously, the specificity for this reaction requires the host cell machinery to 
recognize sequences similar to the "hooks" present in the linear fragment. However, 
100% sequence identity is not required, as shown by Kouprina et al, Genomics, 
5 53(l):21-28 (October 1998), where the author describes using degenerate repeated 
sequences common in the human genome to recover fragments of human DNA from 
a rodent/human hybrid cell line. 

In another example, only one "hook" is required, as described by Larionov et 
al, Proc. Natl. Acad. Sci. USA, 95(8):4469-74 (April 1998). For this type of 

10 experiment, termed "radial TAR cloning," the other region of sequence similarity to 
drive the recombination is derived from a repeated sequence from the genome. In 
this way, regions of DNA adjacent to the Zmaxl gene coding region can be 
recovered and examined for alterations that may affect function. 
XXIV. Methods of Use: Genomic Screening 

15 The use of polymorphic genetic markers linked to the HBM gene or to 

Zmaxl is very useful in predicting susceptibility to osteoporosis or other bone 
diseases. Koller et al, Amer. J. BoneMin. Res., 13:1903-1908 (1998) have 
demonstrated that the use of polymorphic genetic markers is useful for linkage 
analysis. Similarly, the identification of polymorphic genetic markers within the 

20 high bone mass gene will allow the identification of specific allelic variants that are 
in linkage disequilibrium with other genetic lesions that affect bone development. 
Using the DNA sequence from the BACs, a dinucleotide CAn repeat was identified 
and two unique PCR primers that will amplify the genomic DNA containing this 
repeat were designed, as shown below: 

25 B200E21C16_L: GAGAGGCTATATCCCTGGGC (SEQ ID NO:38) 

B200E21C16_R: ACAGCACGTGTTTAAAGGGG (SEQ ID NO:39) 
and used in the genetic mapping study. 

This method has been used successfully by others skilled in the art (e.g., 
Sheffield et al, Genet, 4:1837-1844 (1995); LeBlanc-Straceski et al, Genomics, 

30 19:341-9 (1994); Chen et al, Genomics, 25: 1-8 (1995)). Use of these reagents with 
populations or individuals will predict their risk for osteoporosis. Similarly, single 



BNSDOCID: <WO 0177327A1_L> 



WO 01/77327 



PCT/US00/16951 



-110- 

nucleotide polymorphisms (SNPs), such as those shown in Table 4 above, can be 
used as well to predict risk for developing bone diseases or resistance to 
osteoporosis in the case of the HBM gene. 
XXV. Methods of Use: Modulators of Tissue Calcification 
5 The calcification of tissues in the human body is well documented. Towler 

et al, J. Biol Chem., 273:30427-34 (1998) demonstrated that several proteins known 
to regulate calcification of the developing skull in a model system are expressed in 
calcified aorta. The expression of Msx2, a gene transcribed in osteoprogenitor cells, 
in calcified vascular tissue indicates that genes which are important in bone 

10 development are involved in calcification of other tissues. Treatment with HBM 
protein, agonists or antagonists is likely to ameliorate calcification (such as the 
vasculature, dentin and bone of the skull visera) due to its demonstrated effect on 
bone mineral density. In experimental systems where tissue calcification is 
demonstrated, the over-expression or repression of Zmaxl activity permits the 

15 identification of molecules that are directly regulated by the Zmaxl gene. These 

genes are potential targets for therapeutics aimed at modulating tissue calcification. 
For example, an animal, such as the LDLR -/-, mouse is fed a high fat diet and is 
observed to demonstrate expression of markers of tissue calcification, including 
Zmaxl . These animals are then treated with antibodies to Zmaxl or HBM protein, 

20 antisense oligonucleotides directed against Zmaxl or HBM cDNA, or with 

compounds known to bind the Zmaxl or HBM protein or its binding partner or 
ligand. RNA or proteins are extracted from the vascular tissue and the relative 
expression levels of the genes expressed in the tissue are determined by methods 
well known in the art. Genes that are regulated in the tissue are potential therapeutic 

25 targets for pharmaceutical development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
pharmaceutically useful compounds of the present invention that are to be given to 
an individual may be administered in the form of a composition with a 
pharmaceutically acceptable carrier, excipient or diluent, which are well known in 

30 the art. The individual may be a mammal or a bird, preferably a human, a rat, a 
mouse or bird. Such compositions may be administered to an individual in a 
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pharmaceutically effective amount. The amount administered will vary depending 
on the condition being treated and the patient being treated. The compositions may 
be administered alone or in combination with other treatments. 

EXAMPLES 

5 The present invention is described by reference to the following Examples, 

which are offered by way of illustration and are not intended to limit the invention in 
any manner. Standard techniques well known in the art or the techniques 
specifically described below were utilized. 
Example 1 

10 The propositus was referred by her physicians to the Creighton Osteoporosis 

Center for evaluation of what appeared to be unusually dense bones. She was 18 
years old and came to medical attention two years previous because of back pain, 
which was precipitated by an auto accident in which the car in which she was riding 
as a passenger was struck from behind. Her only injury was soft tissue injury to her 

1 5 lower back that was manifested by pain and muscle tenderness. There was no 

evidence of fracture or subluxation on radiographs. The pain lasted for two years, 
although she was able to attend school full time. By the time she was seen in the 
Center, the pain was nearly resolved and she was back to her usual activities as a 
high school student. Physical exam revealed a normal healthy young woman 

20 standing 66 inches and weighing 128 pounds. Radiographs of the entire skeleton 
revealed dense looking bones with thick cortices. All bones of the skeleton were 
involved. Most importantly, the shapes of all the bones were entirely normal. The 
spinal BMC was 94.48 grams in LI -4, and the spinal BMD was 1.667 gm/cm 2 in 
Ll-4. BMD was 5.62 standard deviations (SD) above peak skeletal mass for 

25 women. These were measured by DXA using a Hologic 2000~. Her mother was 
then scanned and a lumbar spinal BMC of 58.05 grams and BMD of 1 .500 gm/cm 2 
were found. Her mother's values place her 4.12 SD above peak mass and 4.98 SD 
above her peers. Her mother was 51 years old, stood 65 inches and weighed 140 
pounds. Her mother was in excellent health with no history of musculoskeletal or 

30 other symptoms. Her father's lumbar BMC was 75.33 grams and his BMD was 
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1.118 gm/cm 2 . These values place him 0.25 SD above peak bone mass for males. 
He was in good health, stood 72 inches tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her 
mother, which resulted in very high bone mass, but an otherwise normal skeleton, 
5 and attention was focused on the maternal kindred. In U.S. Patent No. 5,691,153, 
twenty- two of these members had measurement of bone mass by DXA. In one 
case, the maternal grandfather of the propositus, was deceased, however, medical 
records, antemortem skeletal radiographs and a gall bladder specimen embedded in 
paraffin for DNA genotyping were obtained. His radiographs showed obvious 

10 extreme density of all of the bones available for examination including the femur 
and the spine, and he was included among the affected members. In this invention, 
the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over the original kinship (Johnson et al, Am. 
J. Hum. Genet, 60:1326-1332 (1997)) because, among the fourteen individuals 

15 added since the original study, two individuals harbor key crossovers. X-linkage is 
ruled out by the presence of male-to-male transmission from individual 12 to 14 and 
15. 

Example 2 

The present invention describes DNA sequences derived from two BAC 
20 clones from the HBM gene region, as evident in Table 7 below, which is an 

assembly of these clones. Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10- 
1 1) was deposited at the American Type Culture Collection (ATCC), 10801 
University Blvd., Manassas, VA 201 10-2209 U.S.A., on December 30, 1997. Clone 
b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) was deposited at the American 
25 Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10- 
2209 U.S.A., on October 2, 1998. These sequences are unique reagents that can be 
used by one skilled in the art to identify DNA probes for the Zmaxl gene, PCR 
primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or 
regulatory elements of the Zmaxl gene. 
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TABLE7 



Contig 


ATCC No. 


SEQID 
NO. 


Length 
(base pairs) 


b527dl2-h_contig302G 


980720 


5 


3096 


b527dl2-h_contig306G 


980720 


6 


26928 


b527dl2-h_contig307G 


980720 


7 


29430 


b527dl2-h_contig308G 


980720 


8 


33769 


b527dl2-h_contig309G 


980720 


9 


72049 


b200e21-h_contigl 


980812 


10 


8705 


b200e21-h_contig4 


980812 


11 


66933 



10 

The disclosure of each of the patents, patent applications and publications 
cited in the specification is hereby incorporated by reference herein in its entirety. 

Although the invention has been set forth in detail, one skilled in the art will 
recognize that numerous changes and modifications can be made, and that such 
1 5 changes and modifications may be made without departing from the spirit and scope 
of the invention. 

This application claims priority to U.S. Application Nos. 09/543,771 and 
09/544,398 filed on April 5, 2000, which are a continuation-in-part of Application 
No. 09/229,319, filed January 13, 1999, which claims benefit of U.S. Provisional 
20 Application No. 60/071,449, filed January 13, 1998, and U.S. Provisional 
Application No. 60/105,51 1, filed October 23, 1998, all of which are herein 
incorporated by reference in their entirety. 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid sequence of SEQ ID NO: 2. 

2. The isolated nucleic acid sequence of claim 1, wherein the nucleic 
5 acid sequence is DNA. 

3. An isolated amino acid sequence of SEQ ID NO: 4. 

4. A nucleic acid sequence encoding the amino acid sequence of SEQ 
ID NO:4. 

5. A replicative cloning vector comprising the nucleic acid sequence of 
10 claim 1 and a replicon operative in an isolated host cell. 

6. An isolated host cell transformed with the replicative cloning vector 
of claim 5. 

7. An expression vector comprising the nucleic acid sequence of claim 1 
operably linked to a transcription regulatory region. 

15 8 . An isolated host cell transformed with the expression vector of claim 

7. 

9. A method for testing a substance as a therapeutic agent for bone 
modulation in a host comprising administering the nucleic acid of claim 1 to the 
host, and assessing whether bone modulation occurs. 

20 1 0. The method of claim 9, wherein the host is a cell or an animal. 
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15 



20 



11. The method of claim 10, wherein the animal is a human, a rodent or a 

bird. 

12. A method of identifying a molecule involved in bone modulation 
comprising identifying a molecule that binds to, or that inhibits binding of a 
molecule to, HBM. 

13. The method of claim 12, wherein said molecule is a protein. 

14. A method for identifying a protein involved in bone modulation 
comprising identifying a protein that has an expression level that is different in a 
first host comprising the Zmaxl gene when compared to a second host comprising 
the HBM gene. 

15. The method of claim 14, wherein the host is a cell or an animal. 

16. A method of identifying a candidate protein involved in bone 
modulation comprising 

identifying a protein in a first individual having the high bone mass 
phenotype; 

identifying a protein in a second individual not having the high bone mass 
phenotype; 

comparing the protein of the first individual to the protein of the second 
individual, wherein (i) the protein that is present in the first individual but not the 
second individual is the candidate protein or (ii) the protein that is present in a 
higher amount in the first individual than in the second individual is the candidate 
protein or (hi) the protein that is present in a lower amount in the first individual 
than in the second individual is the candidate protein. 

17. The method of claim 16, further comprising producing an antibody to 
the candidate protein. 
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18. A method of identifying a candidate protein involved in bone 
modulation comprising 

identifying a protein in a first individual having the high bone mass 
phenotype; 

5 identifying a protein in a second individual not having the high bone mass 

phenotype; and 

comparing the protein of the first individual to the protein of the second 
individual, wherein (i) the protein that is present in the second individual but not the 
first individual is the candidate protein or (ii) the protein that is present in a higher 
10 amount in the second individual than in the first individual is the candidate protein 
or (iii) the protein that is present in a lower amount in the second individual than in 
the first individual is the candidate protein. 

19. The method of claim 18, further comprising producing an antibody to 
the candidate protein. 

15 20. A method of testing for HBM activity comprising immobilizing an 

HBM protein, binding a protein to the HBM protein, and measuring the extent of 
binding. 

2 1 . The method of claim 20, wherein the protein is ApoE. 

22. A method for identification of a candidate molecule involved in bone 
20 modulation comprising 

identifying a molecule that binds to, or that inhibits binding of a molecule to, 
the nucleic acid sequence of SEQ ID NO: 1; 

identifying a molecule that binds to, or that inhibits binding of a molecule to, 
the nucleic acid sequence of SEQ ID NO: 2; and 
25 comparing the extent of binding, or the extent of inhibition of binding, of the 

molecule to each nucleic acid sequence, wherein the molecule that binds, or inhibits 
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binding, more or less to the nucleic acid sequence of SEQ ID NO: 2 or the nucleic 
acid sequence of SEQ ID NO: 1 is the candidate molecule. 

23. The method of claim 22, wherein the candidate molecule is a protein 
or an mRNA. 

5 24. A method of pharmaceutical development for treatment of bone 

development disorders comprising identifying a molecule that binds to the amino 
acid sequence of SEQ ID NO: 4. 

25. The method of claim 24, wherein the molecule inhibits or enhances 
the function of the amino acid. 

26. A method of pharmaceutical development for treatment of bone 
development disorders comprising 

constructing a first host that contains the Zmaxl gene or protein; 
constructing a second host that contains the HBM gene or protein; 
analyzing a difference between the first host and the second host; 
identifying a molecule that, when added to the first host, causes the first host 
to exhibit a characteristic feature of the second host. 

27. The method of claim 26, wherein the host is a cell-free extract, a cell 
or an animal. 

28. The method of claim 26, wherein the difference is a surrogate marker. 

20 29. A method for treating a bone development disorder in an animal 

comprising transferring the nucleic acid sequence of claim 1 into a somatic cell of an 
animal suffering from a bone development disorder. 

30. The method of claim 29, wherein the animal is a human or a bird. 
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31. A method for treating a bone development disorder in an animal 
comprising transferring the nucleic acid sequence of claim 1 into a germ-line cell of 
an animal suffering from a bone development disorder. 

32. The method of claim 31, wherein the animal is a human or a bird. 

33 . A method of altering bone development in .a host comprising 
administering the amino acid sequence of claim 3 to a somatic cell of a host 
suffering from a bone development disorder. 

34. The method of claim 33, wherein the host is a human or a bird. 

35. A method of altering bone development in a host comprising 
administering the amino acid sequence of claim 3 to a germ-line cell in a host 
suffering from a bone development disorder. 

36. The method of claim 35, wherein the animal is a human or a bird. 

37. A method of treating osteoporosis comprising administering the 
amino acid sequence of claim 3 to a patient in need thereof. 

38. The method of claim 37, wherein the patient is a human or a bird. 

39. A method of treating osteoporosis comprising administering the 
extracellular domain of the amino acid sequence of claim 3 to a patient in need 
thereof. 

40. The method of claim 39, wherein the patient is a human or a bird. 
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41 . A method of treating osteoporosis comprising administering the 
intracellular domain of the amino acid sequence of claim 3 to a patient in need 



thereof. 



42. The method of claim 41, wherein the patient is a human or a bird. 



5 



43. A method for treating bone development disorders comprising 



administering a molecule that binds to the nucleic acid sequence of claim 1 to a 
patient in need thereof. 

44. The method of claim 43, wherein the patient is a human or a bird. 

45. A method for treating bone development disorders comprising 

10 administering an antibody to a patient in need thereof, wherein the antibody is to the 
amino acid sequence of claim 3. 

46. A method for diagnostic screening for a genetic predisposition to a 
bone development disorder comprising screening a sample from a patient with a 
nucleotide sequence derived from the genomic or cDNA nucleic acid sequence of 



47. A diagnostic assay for bone development disorders comprising an 
antibody to the HBM protein. 

48. A method for identifying a genetic predisposition to bone 
development disorders comprising performing a haplotype analysis using the nucleic 



49. A method of expressing the HBM protein in bone tissue comprising 
constructing an expression vector comprising a promoter that directs expression in 
bone tissue operably linked to the nucleic acid sequence of claim 1 . 



15 



HBM. 



20 



acid sequence of claim 1. 
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50. The method of claim 49, wherein the promoter that directs expression 
in bone is an osteocalcin promoter, a bone sialoprotein promoter or an AML-3 
promoter. 

51. A bacterial artificial chromosome having the nucleic acid sequence of 
5 SEQ ID NO: 5, 6, 7, 8, 9, 10 or 1 1. 

52. A method for amplifying a nucleotide polymorphism in the Zmaxl 
gene comprising using the bacterial artificial chromosome of claim 51. 

53. A method for amplifying a nucleotide polymorphism in the HBM 
gene comprising using the bacterial artificial chromosome of claim 51 . 

10 54. A method for identifying a regulatory element of a HBM gene 

comprising using the bacterial artificial chromosome of claim 1 or claim 51. 

« 55. An isolated nucleic acid sequence comprising at least 15 contiguous 

nucleotides of the nucleic acid sequence of SEQ ID NO: 2, wherein one of the at 
least 15 contiguous nucleotides is thymine at position 582. 

15 56. The isolated nucleic acid sequence of claim 55 that is DNA. 

57. The isolated nucleic acid sequence of claim 55 that is RNA. 

58. A replicative cloning vector comprising the nucleic acid sequence of 
claim 55 and a replicon operative in a host cell. 

59. An isolated host cell transformed with the replicative cloning vector 
20 of claim 58. 
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60. An expression vector comprising the nucleic acid sequence of claim 
55 operably linked to a transcription regulatory region. 

6 1 . An isolated host cell transformed with the expression vector of claim 

60. 

5 62. An isolated nucleic acid sequence comprising at least 15 contiguous 

nucleotides of the nucleic acid sequence of SEQ ID NO: 2, wherein one of the at 
least 15 contiguous nucleotides is thymine at position 582, and which encodes for an 
amino acid sequence including a valine corresponding to valine at position 171 of 
SEQ ID NO: 4. 

10 63. The nucleic acid sequence of claim 62 which is DNA. 

64. An isolated nucleic acid segment of at least 15 contiguous 
nucleotides including a polymorphic site from the nucleic acid sequence of SEQ ID 
NO: 2 in which G at position 582 is replaced by T, and sequences complementary 
thereto. 

15 65. The isolated nucleic acid segment of claim 64, wherein said 

complementary sequence is the reverse complement. 

66. The isolated nucleic acid segment of claim 65, wherein said reverse 
complementary sequence is mRNA. 

67. The isolated nucleic acid segment of claim 64 that is DNA. 
20 68. The isolated nucleic acid segment of claim 64 that is cDNA. 

69. The isolated nucleic acid segment of claim 65 that is RNA. 
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70. An isolated nucleic acid segment of at least 15 contiguous 
nucleotides including a single nucleotide polymorphic site from an exon sequence 
selected from the group consisting of: 

SEQ ID NO: 9 wherein nucleotide 69169 is replaced by A, 
SEQ ID NO: 9 wherein nucleotide 27402 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 27841 is replaced by C, 
SEQ ID NO: 9 wherein nucleotide 35600 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 45619 is replaced by A, 
SEQ ID NO: 9 wherein nucleotide 46018 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 46093 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 46190 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 50993 is replaced by C, 
SEQ ID NO: 9 wherein nucleotide 51 124 is replaced by T, 
SEQ ID NO: 9 wherein nucleotide 55461 is replaced by T, 
SEQ ID NO: 9 wherein nucleotide 63645 is replaced by A, 
SEQ ID NO: 9 wherein nucleotide 63646 is replaced by C, 
SEQ ID NO: 9 wherein nucleotide 24809 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 27837 is replaced by C, 
SEQ ID NO: 9 wherein nucleotide 31485 is replaced by T, 
SEQ ID NO: 9 wherein nucleotide 31683 is replaced by G, 
SEQ ID NO: 9 wherein nucleotide 24808 is replaced by G, 
SEQ ID NO: 8 wherein nucleotide 31340 is replaced by C, 
SEQ ID NO: 8 wherein nucleotide 32538 is replaced by G, 
SEQ ID NO: 8 wherein nucleotide 13224 is replaced by G, 
SEQ ID NO: 8 wherein nucleotide 21 1 19 is replaced by A, 
SEQ ID NO: 8 wherein nucleotide 30497 is replaced by A, 
SEQ ID NO: 9 wherein nucleotide 2481 1 is replaced by C, 
SEQ ID NO: 9 wherein nucleotide 68280 is replaced by A, and 
sequences complementary thereto. 
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71 . The isolated nucleic acid segment of claim 70, wherein nucleotide 
21 1 19 of said exon sequence of SEQ ID NO: 8 is replaced by A. 

72. The isolated nucleic acid segment of claim 70 that is DNA. 

73. The isolated nucleic acid segment of claim 70 that is RNA. 

5 74. The isolated nucleic acid segment of claim 64 or claim 70 which is a 

probe or a primer. 

75. A method of identifying a molecule involved in bone modulation 
comprising identifying a molecule that binds to or that inhibits binding of a 

10 molecule to a protein involved in focal adhesion signaling. 

76. The method of claim 75, wherein the molecule involved in focal 
adhesion signaling binds to a protein selected from the group consisting of: SEQ ID 
NO: 87-109. 

77. The method of claim 75, wherein the molecule involved in focal 

1 5 adhesion signaling binds to a protein selected from the group consisting of: SEQ ID 
NO:90, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:99 and SEQ ID NO: 102. 

78. A method of modulating bone density in a subject by administering 
an agent that regulates a nucleic acid or polypeptide encoded thereby involved in 
focal adhesion signaling. 

20 79. The method of claim 78, wherein the nucleic acid comprises a nucleic 

acid selected from the group consisting of: SEQ ID NOS: 63-86. 

80. The method of claim 78, wherein the nucleic acid comprises SEQ ID 
NO: 66, SEQ ID NO: 71, SEQ ID NO: 77 or SEQ ID NO: 79. 
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81 . The method of claim 78, wherein the polypeptide is selected from the 
group consisting of: SEQ ID NOS: 87-109. 

82. The method of claim 78, wherein the polypeptide is SEQ ID NO:90, 
SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:99 or SEQ ID NO: 102. 

5 83. A nucleic acid comprising SEQ ID NO: 66, SEQ ID NO: 71, SEQ ID 

NO: 77 or SEQ ID NO: 79. 



84. A nucleic acid of claim 83, wherein the nucleic acid is RNA or DNA. 



85. A replicative cloning vector comprising a nucleic acid of claim 83 
and a replicon operative in a host cell. 

1 0 86. An isolated host cell transformed with the replicative cloning vector 

of claim 85. 



87. An expression vector comprising the nucleic acid sequence of claim 

83. 



88. An isolated host cell transformed with the expression vector of claim 

15 87. 



89. A polypeptide comprising SEQ ID NO:90, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:99 or SEQ ID NO: 102. 



90. A nucleic acid encoding a polypeptide selected from the group 
consisting of SEQ ID NO:90, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:99 or 
20 SEQ ID NO: 102. 
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91 . A method of treating bone development disorders comprising the step 
of administering an agent which modulates a nucleic acid or a polypeptide involved 
in focal adhesion signaling. 

92 . The method of claim 9 1 , wherein the nucleic acid modulated by the 
5 agent is selected from any one of SEQ ID NOS: 63-86. 

93. The method of claim 91, wherein the polypeptide modulated by the 
agent is selected from any one of SEQ ID NOS: 87-109. 
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Exon 1 

ACTAAAGCGCCGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCGG 
GCCCGTCCGGCCGCCGGACAACATGGAGGCAGCGCCGCCCGGGCCGCC 
GTGGCCGCTGCTGCTGCTGCTGCTGCTGCTGCTGGCGCTGTGCGGCTGC 
CCGGCCCCCGCCGCGGCC 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

gccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGCT 

GGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGCGG 

CCTGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGCCGTG 

TACTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGACCTACCTGAACC 

AGACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGGTCTCTCC 

CGACGGCCTCGCCTGCGACTGGGTGGGCAAGAAGCTGTACTGGACGGA 

CTCAGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGCACATCCCGG 

AAGGTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCCATCGCCTTGG 

ACCCCGCTCACGGgtaaaccctgctg 

... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGfiTGAGACGCCCCGGATTGA 

GCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCGGA 

CATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAGCTC 

TACTGGGCTGACGCCAAGCTCAGCTTCATCCACCGTGCCAACCTGGACG 

GCTCGTTCCGgtaggtacccac 

... 6094 nt ... 

Exon 4 Coordinates: 527dl2_Contig308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCCC 

TGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGCTC 

CATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGATCCTG 

AGTGCCCTATACTCACCCATGGACATCCAGGTGCTGAGCCAGGAGCGGC 

AGCCTTTCTgtgagtgccgg 

... 1 827 nt ... 

Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTGT 
GCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACGGG 
TGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcggtgggacg 
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... 20923 nt ... 

Exon 6 Coordinates: 527dl2_Contig309G 7705-8100 

ctccacagGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTACG 

GAGGATCTCGCTGGACACGCCGGACTTCACCGACATCGTGCTGCAGGTG 

GACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGGGCT 

ATGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGCGTACCT 

GGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAACGACCC 

CGATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTGGACCGAC 

ACGGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCACCTCCCGCA 

AGATCCTGGTGTCGGAGGACCTGGACGAGCCCCGAGCCATCGCACTGCA 

CCCCGTGATGGGgtaagacgggc 

3211 nt 

Exon 7 Coordinates: 527dl2_Contig309G 11311-11482 

ttcttctccagCCTCATGTACTGGACAGACTGGGGAGAGAACCCTAAAATCGAG 
TGTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCTGGTCAATGCCTCCC 
TCGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGCTCT 
ACTGGGGAGACGCCAAGACAGACAAGATCGAGgtgaggctcctgtgg 

13445 nt 

Exon 8 Coordinates: 527dl2_Contig309G 24927-25143 

ccgtcctgcagGTGATCAATGTTGATGGGACGAAGAGGCGGACCCTCCTGGAG 

GACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTTCATCT 

ACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGGTCAA 

GGCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGGGGCTC 

AAAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 

....2826 nt 

Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 

gttcgcttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCACC 

TGTGCTTCTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGGCCT 

GGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTTCTTG 

GTCTTCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGACCAATA 

ACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTCAGCCCT 

GGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGTCAGCCTG 

AAGgtagcgtgggc 

......3102 

FIG. 3B 
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Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCAC 

GTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTGACTGGA 

TGGGCAAGAACCTCTACTGGGCCGACACTGGGACCAACAGAATCGAAGT 

GGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTGGAGGGACTT 

GGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgtaagtgtttgcctgtc 

1297 nt 

Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCGT 
GCGGGCCTTCATGGACGGGACCAACTGCATGACGCTGGTGGACAAGGTG 
GGCCGGGCCAACGACCTCACCATTGACTACGCTGACCAGCGCCTCTACT 
GGACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTGGgtgaggg 

ccgggct 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCGT 

TCGGTCTGACGCAGTACAGCGATTATATCTACTGGACAGACTGGAATCT 

GCACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCACCCTC 

ATCCAGGGCCACCTGGACTTCGTGATGGACATCCTGGTGTTCCACTCCT 

CCCGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCAGTGTGG 

GCAGCTGTGCCTTGCCATCCCCGGCGGCCACCGCTGCGGCTGCGCCTCA 

CACTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaagtgcctcatggt 

2006 nt 

Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAGT 

CGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCTGC 

ATGGACTGAGGAACGTCAAAGCCATCGACTATGACCCACTGGACAAGTT 

CATCTACTGGGTGGATGGGCGCCAGAACATCAAGCGAGCCAAGGACGAC 

GGGACCCAGgcaggtgccctgtgg 

6965 nt 
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Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 

ctttgtcttacagCCCTTTGTTTTGACCTCTCTGAGCCAAGGCCAAAACCCAGACA 

GGCAGCCCCACGACCTCAGCATCGACATCTACAGCCGGACACTGTTCTG 

GACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGGGGAA 

GCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGGCCATC 

GTCGTCAACGCGGAGCGAGGgtaggaggccaac 

1404nt 

Exon 15 Coordinates: 527dl2_Contig309G 46236-46427 

ccaccctcccgcagGTACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAAGA 
TCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCAC 
CGGCCTCATCCGCCCTGTGGCCCTGGTGGTGGACAACACACTGGGCAAG 
CTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGTGACCTGT 

CAGgtacgcgccccgg 

686 nt 

Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCAGC 

CTCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCA 

GCAGCAGATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGAC 

TCGCATCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAGTGGAG 

GAAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 

3884 nt 

Exon 17 Coordinates: 527dl2_Contig309G 51206-51331 

ttgtctttgcagCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACAT 
CTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCCAC 
CTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 

....3905 nt 

Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

gttctcctctgtccctcccccagAGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGTG 

CCACAGGGGAGATCGACTGTATCCCCGGGGCCTGGCGCTGTGACGGCTT 

TCCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGTGTGCTCC 

GCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGC 

GCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACT 

GTGACGgtgaggccctcc 

3052 nt 

FIG. 3D 
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Exon 19 Coordinates: 527dl2__Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTG 

TGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCT 
CCGACGAGCTCATGTGTGgtgagccagctt 

1448 nt 



Exon 20 Coordinates: 527dl2_Contig309G 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGC 
AGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGG 
TGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGG 
GCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGC 

CCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGgta 

aggagcctgagatatggaa 



....1095 nt 

Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTGA 
TGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCACAG 

GGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCCGCC 

Ggtgaggggcggg 



6513 nt 

Exon 22 Coordinates: 527dl2_Contig309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCCT 

GTACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGAGAC 

CGTACAGgtaggacatcccctgcag 



.2273 nt. 



FIG. 3E 
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Exon 23 Coordinates: 527dl2_Contig309G 70435-70901 

tcaaacattccggccactgcgagaccgtacagGCCCTACATCATTCGAGGAATGGCGCCCCC 

GACGACGCCCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCCAGC 

CGCTGGAAGGCCAGCAAGTACTACCTGGATTTGAACTCGGACTCAGACC 

CCTATCCACCCCCACCCACGCCCCACAGCCAGTACCTGTCGGCGGAGGA 

CAGCTGCCCGCCCTCGCCCGCCACCGAGAGGAGCTACTTCCATCTCTTC 

CCGCCCCCTCCGTCCCCCTGCACGGACTCATCC TGACCTCGGCCGGGGGA 

CTCTGGCTTCTCTGTGCCCCTGTAAATAGTTTTAAATATGAACAAAGAAAAAA 
AT AT ATTTT ATG ATTT AAAAAAT AA AT AT A ATTGGG ATTTT AAA A A C A TG A G A 
AATGTGAACTGTGATGGGGTGGGCAGGGCTGGGAGAACTTTGTACAGTGGAG 
AAATATTTATA AACTTA ATTTTGTA A A A G A 



FIG. 3F 
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SEQUENCE LISTING 



<110> John P. Carulli et al . * 

<120> THE HIGH BONE MASS GENE OF llql3.3 

<130> 032796-021 

<150> US 09/544,398 
<151> 2000-04-05 
<150> US 09/543,771 
<151> 2000-04-05 
<150> US 09/229,319 
<151> 1999-01-13 
<150> US 60/071,449 
<151> 1998-01-13 
<150> US 60/105,511 
<151> 1998-10-23 
<160> 109 

<210> 1 
<211> 5120 
<212> DNA 

<213> Homo sapiens 
<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 



Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
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15 10 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gcc teg ccg etc ctg eta ttt gcc aac cgc egg gac gta egg ctg 2 05 

Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gcc ggc gga gtc aag ctg gag tec ace ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gcc gca gtg gac ttc cag ttt tec aag gga gcc gtg 3 01 

Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 70 75 

tac tgg aca gac gtg age gag gag gcc ate aag cag ace tac ctg aac 34 9 

Tyr Trp Thr Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn 

80 85 90 

cag acg ggg gcc gcc gtg cag aac gtg gtc ate tec ggc ctg gtc tct 3 97 

Gin Thr Gly Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser 

95 100 105 

ccc gac ggc etc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445 
Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 

gac tea gag acc aac cgc ate gag gtg gcc aac etc aat ggc aca tec 4 93 

Asp Ser Glu Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser 

130 135 140 

egg aag gtg etc ttc tgg cag gac ctt gac cag ccg agg gcc ate gcc 541 
Arg Lys Val Leu Phe Trp Gin Asp Leu Asp Gin Pro Arg Ala lie Ala 
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145 150 155 

ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg ggt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr 

160 165 170 

ccc egg att gag egg gca ggg atg gat ggc age ace egg aag ate att 637 
Pro Arg He Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He He 

175 180 185 

gtg gac teg gac att tac tgg ccc aat gga ctg ace ate gac ctg gag 685 
Val Asp Ser Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu 
190 195 200 205 

gag cag aag etc tac tgg get gac gee aag etc age ttc ate cac cgt 733 
Glu Gin Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg 

210 215 220 

gee aac ctg gac ggc teg ttc egg cag aag gtg gtg gag ggc age ctg 7 81 

Ala Asn Leu Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc ttc gee ctg acg etc tec ggg gac act ctg tac tgg aca 829 
Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 

240 245 250 

gac tgg cag acc cgc tec ate cat gee tgc aac aag cgc act ggg ggg 877 
Asp Trp Gin Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gee etc tac tea ccc atg gac ate cag 925 
Lys Arg Lys Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin 
270 275 280 285 

gtg ctg age cag gag egg cag cct ttc ttc cac act cgc tgt gag gag 973 
Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu G lu 
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290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cca age gag cct 1021 

Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gec tgc ccc acg ggt gtg cag ctg cag gac aac ggc 10 69 

Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Gin Asp Asn Gly 

320 325 330 

agg acg tgt aag gca gga gec gag gag gtg ctg ctg ctg gec egg egg 1117 

Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg 

335 340 345 

acg gac eta egg agg ate teg ctg gac acg ccg gac ttc ace gac ate 1165 

Thr Asp Leu Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie 

350' 355 360 365 

gtg ctg cag gtg gac gac ate egg cac gee att gee ate gac tac gac 1213 

Val Leu Gin Val Asp Asp lie Arg His Ala lie Ala lie Asp Tyr Asp 

370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 1261 

Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie 

385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac ace 13 09 

Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gee cga aac 1357 

Glu He Asn Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg ace gac acg ggc acg gac cgc ate gag gtg acg cgc etc 1405 

Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu 
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430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 14 53 

Asn Gly Thr Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro 

450 455 460 

cga gec ate gca ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 15 01 

Arg Ala He Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gee aac ttg gat ggg cag gag 1549 
Trp Gly Glu Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

c 99 c 9t 9 fc 9 ct 9 gtc aat gee tec etc ggg tgg ccc aac ggc ctg gec 1597 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gec aag aca gac 1645 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
510 515 520 525 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 . 

Lys He Glu Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 
Glu Asp Lys Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 
He Tyr Trp Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gtc aag gee age egg gac gtc ate att gac cag ctg ccc gac ctg atg 1837 
Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 
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575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 1885 

Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 

590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 1933 

Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 

Ala Thr Arg Cys Gly Cys Pro lie Gly Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cct gag gee ttc ttg gtc ttc acc age aga gec 2029 

Lys Thr Cys lie Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gec ate cac agg ate tec etc gag acc aat aac aac gac gtg gee ate 2 077 

Ala lie His Arg lie Ser Leu Glu Thr Asn Asn Asn Asp Val Ala lie 

655 660 665 

ccg etc acg ggc gtc aag gag gee tea gec ctg gac ttt gat gtg tec 2125 

Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser 

670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag acc ate age cgc 2173 

Asn Asn His lie Tyr Trp Thr Asp Val Ser Leu Lys Thr lie Ser Arg 

690 695 700 

gec ttc atg aac ggg age teg gtg gag cac gtg gtg gag ttt ggc ctt 2221 

Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gee gtt gac tgg atg ggc aag aac etc tac 22 69 

Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 
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720 725 730 

tgg gcc gac act ggg acc aac aga ate gaa gtg gcg egg ctg gac ggg 2 317 

Trp Ala Asp Thr Gly Thr Asn Arg lie Glu Val Ala Arg Leu Asp Gly 

735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gac aac ccg agg teg 2365 
Gin Phe Arg Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser 
750 755 760 765 

ct g g cc ctg gat ccc acc aag ggc tac ate tac tgg acc gag tgg ggc 2413 
Leu Ala Leu Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Trp Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gcc ttc atg gac ggg acc aac tgc atg 2461 
Gly Lys Pro Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785 790 795 

acg ctg gtg gac aag gtg ggc egg gcc aac gac etc acc att gac tac 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr 

800 805 810 

get gac cag cgc etc tac tgg acc gac ctg gac acc aac atg ate gag 2~557 
Ala Asp Gin Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met He Glu 

815 820 825 

teg tec aac atg ctg ggt cag gag egg gtc gtg att gcc gac gat etc 2605 
Ser Ser Asn Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu 
830 835 840 845 

ccg cac ccg ttc ggt ctg acg cag tac age gat tat ate tac tgg aca 2653 
Pro His Pro Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr 

850 855 860 

gac tgg aat ctg cac age att gag egg gcc gac aag act age ggc egg 2701 
Asp Trp Asn Leu His Ser He Glu Arg Ala Asp Lys Thr Ser Gly Arg 
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865 870 875 

aac cgc acc etc ate cag ggc cac ctg gac ttc gtg atg gac ate ctg 2749 
Asn Arg Thr Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu 

880 885 890 

gtg ttc cac tec tec cgc cag gat ggc etc aat gac tgt atg cac aac 2797 
Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 

895 900 905 

aac ggg cag tgt ggg cag ctg tgc ctt gee ate ccc ggc ggc cac cgc 2 845 

Asn Gly Gin Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg 
910 915 920 925 

t9 c S9 C tgc gec tea cac tac acc ctg gac ccc age age cgc aac tgc 2 893 

Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ccg ccc acc acc ttc ttg ctg ttc age cag aaa tct gec ate agt 2941 
Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser 

945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg ccc ctg 2 989 

Arg Met He Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gec ate gac tat gac cca ctg gac aag 3037 
His Gly Leu Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gee aag gac 3 085 

Phe He Tyr Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg acc cag ccc ttt gtt ttg acc tct ctg age caa ggc caa aac 313 3 

Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 
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1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gac ate tac age egg aca 3181 
Pro Asp Arg Gin Pro His Asp Leu Ser lie Asp lie Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gec ace aat ace ate aac gtc cac agg ctg 3229 
Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr lie Asn Val His Arg Leu 

1040 1045 1050 

age ggg gaa gec atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3 2 77 

Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gec ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325 
Arg Ala lie Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gec aag ate gaa cgc gca gec ctg gac ggc acc 3373 
Met Gin Asp Arg Ala Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc acc acc ggc etc ate cgc cct gtg gee ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu lie Arg Pro Val Ala Leu 

1105 1110 1115 

gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gec aac cgc ctg acc ctg ■ 3517 

Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gee aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 35 65 

Glu Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
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1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3 613 

His Leu Tyr Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gec cac 3 661 

Lys Thr Thr Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3 709 

Leu Thr Gly He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gee cac cca tgt gee cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He 

1215 1220 1225 

gee aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 3805 
Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg cec acc tgc tec ccg 3 853 

Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 

1250 1255 1260 

gac cag ttt gca tgt gee aca ggg gag ate gac tgt ate cec ggg gee 3 901 

Asp Gin Phe Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala 

1265 1270 1275 

tgg cgc tgt gac ggc ttt cec gag tgc gat gac cag age gac gag gag 3 949 

Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc cec gtg tgc tec gee gec cag ttc cec tgc gcg egg ggt cag 3 997 

Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 
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1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gap ggc gag gca gac tgt cag gac 4045 
Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gec ate tgc ctg ccc aac cag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

egg tgt gcg age ggc cag tgt gtc etc ate aaa cag cag tgc gac tec 4141 
Arg Cys Ala Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser 

1345 1350 1355 

ttc ccc gac tgt ate gac ggc tec gac gag etc atg tgt gaa ate ace 4189 
Phe Pro Asp Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr 

1360 1365 1370 

aag ccg ccc tea gac gac age ccg gee cac age agt gec ate ggg ccc 4237 
Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala lie Gly Pro 

1375 1380 1385 

gtc att ggc ate ate etc tct etc ttc gtc atg ggt ggt gtc tat ttt 42 85 

Val lie Gly lie lie Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gec aac ggg ccc 43 33 

Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ccg cac gag tat gtc age ggg ace ccg cac gtg ccc etc aat ttc 43 81 

Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gee ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 4429 
lie Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys 
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1440 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aea ggg gee teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
1470 1475 1480 1485 

teg tec age acg aag gee acg ctg tac ccg ccg ate ctg aac ccg ccg 4573 
Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gec acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gee act gcg aga ccg tac agg ccc tac ate att 4669 
Ser Ser Asn He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie He 

1520 1525 1530 

c ga gga atg gcg ccc ccg acg acg ccc tgc age ace gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gee age cgc tgg aag gee age aag tac tac ctg gat 4 7 65 

Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee ace gag 4 861 

Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 
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1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5120 



<210> 2 

<211> 5120 

<212> DNA 

<213> Homo sapiens 



<400> 2 

actaaagege cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 10 9 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 10 
ctg *ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gec ccc gec 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gec teg ccg etc ctg eta ttt gec aac cgc egg gac gta egg ctg 2 05 

Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 



BNSDOCID: <WO 0177327A1_I_> 




WO 01/77327 

gtg gac gcc ggc gga gtc 

Val Asp Ala Gly Gly Val 
50 

c tg gag gat gcg gcc gca 

Leu Glu Asp Ala Ala Ala 
65 

tac tgg aca gac gtg age 

Tyr Trp Thr Asp Val Ser 
80 

cag acg ggg gcc gcc gtg 

Gin Thr Gly Ala Ala Val 
95 

ccc gac ggc etc gcc tgc 

Pro Asp Gly Leu Ala Cys 

110 115 

gac tea gag acc aac cgc 

Asp Ser Glu Thr Asn Arg 
130 

egg aag gtg etc ttc tgg 

Arg Lys Val Leu Phe Trp 
145 

ttg gac ccc get cac ggg 

Leu Asp Pro Ala His Gly 
160 

ccc egg att gag egg gca 

Pro Arg lie Glu Arg Ala 
175 



14 

aag ctg gag tec acc ate 
Lys Leu Glu Ser Thr lie 
55 

gtg gac ttc cag ttt tec 
Val Asp Phe Gin Phe Ser 
70 

gag gag gcc ate aag cag 
Glu Glu Ala lie Lys Gin 
85 

cag aac gtg gtc ate tec 
Gin Asn Val Val lie Ser 
100 105 
gac tgg gtg ggc aag aag 
Asp Trp Val Gly Lys Lys 
120 

ate gag gtg gcc aac etc 
lie Glu Val Ala Asn Leu 
135 

cag gac ctt gac cag ccg 
Gin Asp Leu Asp Gin Pro 
150 

tac atg tac tgg aca gac 
Tyr Met Tyr Trp Thr Asp 
165 

ggg atg gat ggc age acc 
Gly Met Asp Gly Ser Thr 
180 185 




PCT/US00/16951 

9tg gtc age ggc 253 
Val Val Ser Gly 
60 

aag gga gcc gtg 301 
Lys Gly Ala Val 
75 

acc tac ctg aac 349 

Thr Tyr Leu Asn 

90 

ggc ctg gtc tct 397 
Gly Leu Val Ser 

ctg tac tgg acg 445 
Leu Tyr Trp Thr 
125 

aat ggc aca tec 493 
Asn Gly Thr Ser 
140 

agg gcc ate gcc 541 
Arg Ala lie Ala 
155 

tgg gtt gag acg 589 

Trp Val Glu Thr 

170 

egg aag ate att 63 7 

Arg Lys lie lie 
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gtg gac teg gac att tac tgg ccc aat gga ctg acc ate gac ctg gag 685 

Val Asp Ser Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu 

190 195 200 205 

gag cag aag etc tac tgg get gac gee aag etc age ttc ate cac cgt 73 3 

Glu Gin Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe lie His Arg 

210 215 220 

gec aac ctg gac ggc teg ttc egg cag aag gtg gtg gag ggc age ctg 781 

Ala Asn Leu Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu 

225 230 235 

acg cac ccc ttc gee ctg acg etc tec ggg gac act ctg tac tgg aca 829 

Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr 

240 245 250 

gac tgg cag acc cgc tec ate cat gee tgc aac aag cgc act ggg ggg 877 

Asp Trp Gin Thr Arg Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly 

255 260 265 

aag agg aag gag ate ctg agt gee etc tac tea ccc atg gac ate cag 92 5 

Lys Arg Lys Glu lie Leu Ser Ala Leu Tyr Ser Pro Met Asp lie Gin 
270 275 280 285 

gtg ctg age cag gag egg cag cct ttc ttc cac act cgc tgt gag gag 973 

Val Leu Ser Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu 

290 295 300 

gac aat ggc ggc tgc tec cac ctg tgc ctg ctg tec cca age gag cct 1021 

Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro 

305 310 315 

ttc tac aca tgc gee tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069 

Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gin Leu Gin Asp Asn Gly 

320 325 330 
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agg acg tgt aag gca 
Arg Thr Cys Lys Ala 
335 

acg gac eta egg agg 
Thr Asp Leu Arg Arg 
350 

gtg ctg cag gtg gac 
Val Leu Gin Val Asp 
370 

ccg eta gag ggc tat 
Pro Leu Glu Gly Tyr 
385 

cgc agg gcg tac ctg 
Arg Arg Ala Tyr Leu 
400 

gag ate aac gac ccc 
Glu lie Asn Asp Pro 
415 

etc tac tgg ace gac 
Leu Tyr Trp Thr Asp 
430 

aac ggc acc tec cgc 
Asn Gly Thr Ser Arg 
450 

cga gee ate gca ctg 
Arg Ala lie Ala Leu 
465 
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gga gec gag gag gtg 
Gly Ala Glu Glu Val 
340 

ate teg ctg gac acg 
lie Ser Leu Asp Thr 
355 

gac ate egg cac gec 
Asp He Arg His Ala 
375 

gtc tac tgg aca gat 
Val Tyr Trp Thr Asp 
390 

gac ggg tct ggg gcg 
Asp Gly Ser Gly Ala 
405 

gat ggc ate gcg gtc 
Asp Gly He Ala Val 
420 

acg ggc acg gac cgc 
Thr Gly Thr Asp Arg 
435 

aag ate ctg gtg teg 
Lys He Leu Val Ser 
455 

cac ccc gtg atg ggc 
His Pro Val Met Gly 
470 




ctg ctg ctg gee 

Leu Leu Leu Ala 
345 

ccg gac ttc acc 

Pro Asp Phe Thr 
360 

att gec ate gac 

He Ala He Asp 

gac gag gtg egg 
Asp Glu Val Arg 
395 

cag acg ctg gtc 
Gin Thr Leu Val 
410 

gac tgg gtg gee 
Asp Trp Val Ala 
425 

ate gag gtg acg 
He Glu Val Thr 
440 

gag gac ctg gac 
Glu Asp Leu Asp 

etc atg tac tgg 
Leu Met Tyr Trp 
475 
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egg egg 1117 
Arg Arg 

gac ate 1165 
Asp He 
365 

tac gac 1213 

Tyr Asp 

380 

gee ate 1261 
Ala He 

aac acc 1309 
Asn Thr 

cga aac 1357 
Arg Asn 

cgc etc 14 05 

Arg Leu 
445 

gag ccc 1453 

Glu Pro 

460 

aca gac 1501 
Thr Asp 
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tgg 99 a 9 a 9 aac cct aaa atc 9*9 tgt 9 c c aac ttg gat ggg cag gag 1549 

Trp Gly Glu Asn Pro Lys lie Glu Cys.. Ala Asn Leu Asp Gly Gin Glu 

480 485 490 

egg cgt gtg ctg gtc aat gec tec etc ggg tgg ccc aac ggc ctg gec 15 97 

Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gec aag aca gac 1645 

Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 

510 515 520 525 

aag atc gag gtg atc aat gtt gat ggg acg aag agg egg ace etc ctg 1693 

Lys lie Glu Val lie Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 

Glu Asp Lys Leu Pro His lie Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

atc tac tgg act gac tgg cag cgc cgc age atc gag egg gtg cac aag 1789 

lie Tyr Trp Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys 

560 565 570 

gtc aag gee age egg gac gtc atc att gac cag ctg ccc gac ctg atg 1837 

Val Lys Ala Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met 

575 580 585 

ggg etc aaa get gtg aat gtg gec aag gtc gtc gga ace aac ccg tgt 1885 

Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 

590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 193 3 

Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 
610 615 620 
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gca acc egg tgt ggc tgc 
Ala Thr Arg Cys Gly Cys 
625 

aag acc tgc ate gtg cct 
Lys Thr Cys He Val Pro 
640 

gec ate cac agg ate tec 
Ala He His Arg He Ser 
655 

ccg etc acg ggc gtc aag 
Pro Leu Thr Gly Val Lys 
670 675 
aac aac cac ate tac tgg 
Asn Asn His He Tyr Trp 
690 

gec ttc atg aac ggg age 
Ala Phe Met Asn Gly Ser 
705 

gac tac ccc gag ggc atg 
Asp Tyr Pro Glu Gly Met 
720 

tgg gee gac act ggg acc 
Trp Ala Asp Thr Gly Thr 
735 

cag ttc egg caa gtc etc 
Gin Phe Arg Gin Val Leu 
750 755 



ccc ate ggc ctg gag ctg 
Pro He Gly Leu Glu Leu 
630 

gag gee ttc ttg gtc ttc 
Glu Ala Phe Leu Val Phe 
645 

etc gag acc aat aac aac 
Leu Glu Thr Asn Asn Asn 
660 665 
gag gec tea gec ctg gac 
Glu Ala Ser Ala Leu Asp 
680 

aca gac gtc age ctg aag 
Thr Asp Val Ser Leu Lys 
695 

teg gtg gag cac gtg gtg 
Ser Val Glu His Val Val 
710 

gec gtt gac tgg atg ggc 
Ala Val Asp Trp Met Gly 
725 

aac aga ate gaa gtg gcg 
Asn Arg He Glu Val Ala 
740 745 
gtg tgg agg gac ttg gac 
Val Trp Arg Asp Leu Asp 
760 



ctg agt gac atg 1981 
Leu Ser Asp Met 
635 

acc age aga gec 2 029 

Thr Ser Arg Ala 

650 

gac gtg gee ate 2 077 

Asp Val Ala He 

ttt gat gtg tec 2125 
Phe Asp Val Ser 
685 

acc ate age cgc 2173 
Thr He Ser Arg 
700 

gag ttt ggc ctt 2221 
Glu Phe Gly Leu 
715 

aag aac etc tac 2269 

Lys Asn Leu Tyr 

730 

egg ctg gac ggg 2317 
Arg Leu Asp Gly 

aac ccg agg teg 2365 
Asn Pro Arg Ser 
765 
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ctg gcc ctg gat ccc acc aag ggc tac ate tac tgg acc gag tgg ggc 2413 
Leu Ala Leu Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Trp Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gcc ttc atg gac ggg acc aac tgc atg 24 61 

Gly Lys Pro Arg lie Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785 790 795 

acg ctg gtg gac aag gtg ggc egg gcc aac gac etc acc att gac tac 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr 

800 805 810 

get gac cag cgc etc tac tgg acc gac ctg gac acc aac atg ate gag 2557 
Ala Asp Gin Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met lie Glu 

815 820 825 

teg tec aac atg ctg ggt cag gag egg gtc gtg att gcc gac gat etc 2605 
Ser Ser Asn Met Leu Gly Gin Glu Arg Val Val lie Ala Asp Asp Leu 
830 835 840 845 

ccg cac ccg ttc ggt ctg acg cag tac age gat tat ate tac tgg aca 2 653 

Pro His Pro Phe Gly Leu Thr Gin Tyr Ser Asp Tyr lie Tyr Trp Thr 

850 855 860 

gac tgg aat ctg cac age att gag egg gcc gac aag act age ggc egg 2701 
Asp Trp Asn Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg 

865 870 875 

aac cgc acc etc ate cag ggc cac ctg gac ttc gtg atg gac ate ctg 2749 
Asn Arg Thr Leu lie Gin Gly His Leu Asp Phe Val Met Asp lie Leu 

880 885 890 

gtg ttc cac tec tec cgc cag gat ggc etc aat gac tgt atg cac aac 2797 
Val Phe His Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn 
895 900 905 
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aac ggg cag tgt ggg cag ctg tgc ctt gcc ate ccc ggc ggc cac cgc 2 845 

Asn Gly Gin Cys Gly Gin Leu Cys Leu Ala lie Pro Gly Gly His Arg 

910 915 920 925 

tgc ggc tgc gcc tea cac tac acc ctg gac ccc age age cgc aac tgc 2893 

Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys 

930 935 940 

age ccg ccc acc acc ttc ttg ctg ttc age cag aaa tct gcc ate agt 2 941 

Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala lie Ser 

945 950 955 

egg atg ate ccg gac gac cag cac age ccg gat etc ate ctg ccc ctg 2 98 9 

Arg Met He Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu 

960 965 970 

cat gga ctg agg aac gtc aaa gcc ate gac tat gac cca ctg gac aag 3 03 7 

His Gly Leu Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gcc aag gac 3 085 

Phe He Tyr Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp 
990 995 1000 1005 

gac ggg acc cag ccc ttt gtt ttg acc tct ctg age caa ggc caa aac 3133 

Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 

1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gac ate tac age egg aca 3181 

Pro Asp Arg Gin Pro His Asp Leu Ser He Asp He Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gcc acc aat acc ate aac gtc cac agg ctg 322 9 

Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu 

1040 1045 1050 
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age ggg gaa gec atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3 27 7 

Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gec ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3 325 

Arg Ala lie Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gec aag ate gaa cgc gca gec ctg gac ggc acc 3373 

Met Gin Asp Arg Ala Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc acc acc ggc etc ate cgc cct gtg gec ctg 3421 

Glu Arg Glu Val Leu Phe Thr Thr Gly Leu lie Arg Pro Val Ala Leu 

1105 1110 1115 

gtg gtg g ac aac aca ct g ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 

Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 

1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gec aac cgc ctg acc ctg 3517 

Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gec aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 

Glu Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3613 

His Leu Tyr Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gee cac 3661 

Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His 
1185 1190 1195 
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etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gee cac cca tgt gec cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His lie Cys lie 

1215 1220 1225 

gee aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 3805 
Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 1235 1240 1245 

etc ctg cag aac ctg ctg ace tgt gga gag ccg ccc acc tgc tec ccg 3 853 

Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 

1250 1255 1260 

gac cag ttt gca tgt gec aca ggg gag ate gac tgt ate ccc ggg gec 3901 
Asp Gin Phe Ala Cys Ala Thr Gly Glu lie Asp Cys lie Pro Gly Ala 

1265 1270 1275 

tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag age gac gag gag 3 94 9 

Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gee gee cag ttc ccc tgc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4 04 5 

Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gee ate tgc ctg ccc aac cag ttc 4093 
Arg Ser Asp Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe 
1330 1335 1340 
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egg tgt gcg age ggc 
Arg Cys Ala Ser Gly 
1345 

ttc ccc gac tgt ate 
Phe Pro Asp Cys lie 
1360 

aag ccg ccc tea gac 
Lys Pro Pro Ser Asp 
1375 

gtc att ggc ate ate 
Val He Gly He He 
1390 

gtg tgc cag cgc gtg 
Val Cys Gin Arg Val 

1410 1415 . 1420 

ttc ccg cac gag tat gtc age ggg ace ccg cac gtg ccc etc aat ttc 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gee ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 4429 
He Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys 

1440 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aca ggg gee teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
1470 1475 1480 1485 
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cag tgt gtc etc ate aaa cag cag tgc gac tec 4141 
Gin Cys Val Leu .He Lys Gin Gin Cys Asp Ser 

1350 1355 
gac ggc tec gac gag etc atg tgt gaa ate acc 4189 
Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr 

1365 1370 
gac age ccg gec cac age agt gee ate ggg ccc 4237 
Asp Ser Pro Ala His Ser Ser Ala He Gly Pro 

1380 1385 
etc tct etc ttc gtc atg ggt ggt gtc tat ttt 4285 
Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1395 1400 1405 

gtg tgc cag cgc tat gcg ggg gee aac ggg ccc 43 3 3 

Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 
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teg tec age acg aag gec acg ctg tac ccg ccg ate ctg aac ccg ccg 4573 
Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gee acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gee act gcg aga ccg tac agg ccc tac ate att 4669 
Ser Ser Asn lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age ace gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gec age cgc tgg aag gee age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee ace gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965 
Ser Ser 
1615 
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aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5 025 

taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5120 



<210> 3 
<211> 1615 
<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 
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130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg lie 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie lie Val Asp Ser 

180 185 190 

Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe lie His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 
340 345 350 
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Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu Gin 

355 360 365 

Val Asp Asp lie Arg His Ala lie Ala lie Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr Glu lie Asn 

405 410 415 

Asp Pro Asp Gly lie Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu Asn Gly Thr 

435 440 445 

Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala He 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
465 470 475 480 

Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys He Glu 

515 520 525 

Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys 

530 535 540 

Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe He Tyr Trp 
545 550 555 560 

Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala 
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565 570 575 

Ser Arg Asp Val lie He Asp Gin Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro He Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 
625 630 635 640 

He Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala He His 

645 650 655 

Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He Pro Leu Thr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His 

675 680 685 

He Tyr Trp Thr Asp Val Ser Leu Lys Asn He Ser Arg Ala Phe Met 

690 695 700 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly Gly Lys Pro 
770 775 780 
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Arg lie Val Arg Ala 
785 

Asp Lys Val Gly Arg 
805 

Arg Leu Tyr Trp Thr 
820 

Met Leu Gly Gin Glu 
835 

Phe Gly Leu Thr Gin 
850 

Leu His Ser lie Glu 
865 

Leu lie Gin Gly His 
885 

Ser Ser Arg Gin Asp 
900 

Cys Gly Gin Leu Cys 
915 

Ala Ser His Tyr Thr 
930 

Thr Thr Phe Leu Leu 
945 

Pro Asp Asp Gin His 
965 

Arg Asn Val Lys Ala 
980 

Trp Val Asp Gly Arg 
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Phe Met Asp Gly Thr Asn 
790 795 
Ala Asn Asp Leu Thr lie 
810 

Asp Leu Asp Thr Asn Met 
825 

Arg Val Val lie Ala Asp 
840 

Tyr Ser Asp Tyr lie Tyr 
855 

Arg Ala Asp Lys Thr Ser 
870 875 
Leu Asp Phe Val Met Asp 
890 

Gly Leu Asn Asp Cys Met 
905 

Leu Ala lie Pro Gly Gly 
920 

Leu Asp Pro Ser Ser Arg 
935 

Phe Ser Gin Lys Ser Ala 
950 955 
Ser Pro Asp Leu lie Leu 
970 

lie Asp Tyr Asp Pro Leu 
985 

Gin Asn lie Lys Arg Ala 




PCT/US00/16951 

Cys Met Thr Leu Val 
800 

Asp Tyr Ala Asp Gin 
815 

lie Glu Ser Ser Asn 
830 

Asp Leu Pro His Pro 
845 

Trp Thr Asp Trp Asn 
860 

Gly Arg Asn Arg Thr 
880 

He Leu Val Phe His 
895 

His Asn Asn Gly Gin 
910 

His Arg Cys Gly Cys 
925 

Asn Cys Ser Pro Pro 
940 

He Ser Arg Met He 
960 

Pro Leu His Gly Leu 
975 

Asp Lys Phe He Tyr 
990 

Lys Asp Asp Gly Thr 
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995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser lie Asp He Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 
1205 1210 1215 
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Cys Ala Arg Asp Asn Gly Gly Cys Ser His lie Cys lie Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu lie Asp Cys lie Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala lie Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
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1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 4 
<211> 1615 
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<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 

15 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg lie 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie lie Val Asp Ser 
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180 185 190 

Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe lie His Arg Ala Ash Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro , 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Arg Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu lie Leu Ser Ala Leu Tyr Ser Pro Met Asp lie Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp lie Val Leu Gin 

355 360 365 

Val Asp Asp lie Arg His Ala lie Ala lie Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie Arg Arg Ala 
385 390 395 400 
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Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr Glu He Asn 

405 410 415 

Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 425 430 

Thr Asp Thr Gly Thr Asp Arg lie Glu Val Thr Arg Leu Asn Gly Thr 

435 440 445 

Ser Arg Lys He Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala lie 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
465 470 475 480 

Asn Pro Lys He Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys lie Glu 

515 520 525 

Val He Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu " Asp Lys 

530 535 540 

Leu Pro His He Phe Gly Phe Thr Leu Leu Gly Asp Phe He Tyr Trp 
545 550 555 560 

Thr Asp Trp Gin Arg Arg Ser He Glu Arg Val His Lys Val Lys Ala 

565 570 575 

Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 
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Cys Gly Cys Pro 
625 

He Val Pro Glu 

Arg He Ser Leu 
660 

Gly Val Lys Glu 
675 

He Tyr Trp Thr 
690 

Asn Gly Ser Ser 
705 

Glu Gly Met Ala 

Thr Gly Thr Asn 
740 

Gin Val Leu Val 
755 

Asp Pro Thr Lys 
770 

Arg He Val Arg 
785 

Asp Lys Val Gly 

Arg Leu Tyr Trp 
820 




615 

He Gly Leu Glu 
630 

Ala Phe Leu Val 
645 

Glu Thr Asn Asn 

Ala Ser Ala Leu 
680 

Asp Val Ser Leu 
695 

Val Glu His Val 
710 

Val Asp Trp Met 
725 

Arg He Glu Val 

Trp Arg Asp Leu 
760 

Gly Tyr He Tyr 
775 

Ala Phe Met Asp 
790 

Arg Ala Asn Asp 
805 

Thr Asp Leu Asp 
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620 

Leu Leu Ser Asp 
635 

Phe Thr Ser Arg 
650 

Asn Asp Val Ala 
665 

Asp Phe Asp Val 

Lys Asn He Ser 
700 

Val Glu Phe Gly 
715 

Gly Lys Asn Leu 
730 

Ala Arg Leu Asp 
745 

Asp Asn Pro Arg 

Trp Thr Glu Trp 
780 

Gly Thr Asn Cys 
795 

Leu Thr He Asp 
810 

Thr Asn Met He 
825 
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Met Lys Thr Cys 
640 

Ala Ala He His 
655 

He Pro Leu Thr 
670 

Ser Asn Asn His 
685 

Arg Ala Phe Met 

Leu Asp Tyr Pro 
720 

Tyr Trp Ala Asp 
735 

Gly Gin Phe Arg 
750 

Ser Leu Ala Leu 
765 

Gly Gly Lys Pro 

Met Thr Leu Val 
800 

Tyr Ala Asp Gin 
815 

Glu Ser Ser Asn 
830 
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Met Leu Gly Gin Glu Arg Val Val lie Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr lie Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu lie Gin Gly His Leu Asp Phe Val Met Asp lie Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala lie Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala lie Ser Arg Met lie 
945 950 955 960 

Pro Asp Asp Gin His Ser Pro Asp Leu lie Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala lie Asp Tyr Asp Pro Leu Asp Lys Phe lie Tyr 

980 985 990 

Trp Val Asp Gly Arg Gin Asn lie Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser lie Asp lie Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr lie Asn Val His Arg Leu Ser Gly Glu 
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1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp. Arg Asp Lys Pro Arg Ala lie 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 1100 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1110 1115 1120 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 1135 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 1160 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 1175 1180 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 1195 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 
1250 1255 1260 
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Ala Cys Ala Thr Gly Glu lie Asp Cys lie Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val lie Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Ash Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys Ser 

1445 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 
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1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 

<211> 3096 

<212> DNA 

<213> Homo sapiens 

<400> 5 

catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60 
tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120 
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cagctccgtc 


ccctaatatt 


catgcgcgga 


tggcccattc 


cgtggtgaaa 


attcccttct 


180 


actctgctaa 


tctgctgttc 


tctctccctc 


ccgtcgggtt 


ctgctcctgc 


cacgttctcc 


240 


cctctcccca 


ccaaaggctg 


ggttttcttt 


gtcagggctc 


ctttcccctt 


tggaagaagg 


300 


ggggctgtat 


ggccttggtg 


cgaggccctc 


cagtgacagg 


atcccccatc 


acccagagtt 


360 


ccacaggccc 


tggtagggag 


gagggggagc 


agaagaggag 


gtgccatctt 


tgcctgctgg 


420 


ggaagggcag 


gggccaccca 


cacagagctc 


tcccatttgc 


tgtggaccct 


ggggccactg 


480 


cccagttcct 


tccaaaggaa 


agccagctcc 


ccaggtggtg 


ggagagtgat 


atggcttcct 


540 


cttaaactta 


gggaattgag 


tgtgtggttg 


cttctaagtg 


ccttagaagc 


cgggagcggc 


600 


tcctggaaag 


agcctgcctg 


ccacagcggg 


ccttaccctg 


gctgtgccca 


cagatgtccc 


660 


tggggcctgc 


cgctcctgcc 


cggctctcct 


ggcctccccc 


ggtgtgggtt 


gggaaaagca 


720 


cagcaaatta 


aaaaacacct 


ccatctctgg 


cctttgaaga 


atgcatctga 


acagccgaga 


780 


gtgtaaaccg 


tggtgaaatg 


tggtctttcc 


agtttgggga 


gaagcagggc 


agagctgggg 


840 


cttttgtacc 


cagggtttcc 


aagagctcct 


gcctccctcg 


gctgggctgg 


ccagggcccc 


900 


ccgctgggac 


ctccagctgt 


aatagggaag 


gttttactgg 


gttgctggcc 


actgtggact 


960 


gcccctaagg 


gcaggtatgc 


ctgcctttac 


ccgggttccc 


ctcctgcctg 


gaagatacag 


1020 


cccatgggag 


gcctgttgtc 


tgtgggatcc 


tccagcatca 


gagacactgg 


ggccagcgtc 


1080 


tgcctggtga 


ggtgcaggcc 


tggcaggccc 


ggtcccccac 


ctgcttgagc 


acccacggtg 


1140 


gtgggggctc 


gctgcctccc 


gagacaatct 


atgtcattgt 


tgtccaagga 


agctaattta 


1200 


gagtagaaag 


ttccgtgtcc 


agtcccactc 


tgtgcgtgtg 


ttagcagggg 


actctcgggc 


1260 


cggagctggg 


tccaccctgg 


tagggggact 


tcatggggcc 


tgggcgacag 


cactgtgtat 


1320 


ttgtgtgtgt 


gtgtgtttgt 


gtgtgtgtgt 


gtctgaggag 


gtggaccagt 


ttctcaaaag 


1380 


gcctgtgacc 


ccaagaacca 


aggaatttca 


gcctgggtgg 


atcacacctt 


cactggtgag 


1440 


tgggacaagc 


tgggggccct 


cgccacagga 


gcagccaggg 


catggggcac 


agttggcctc 


1500 


attcacaaaa 


tgggagtata 


agtgatccct 


gctctggcgg 


ccaggacgat 


gagtgggaac 


1560 


acaccgtgtg 


ggggctgcct 


ggcctgggtg 


tgccgcgggt 


gtccttgttg 


gtgatggttc 


1620 


cacctgcttg 


tgccaccagt 


gccctctggg 


tctcacacac 


aactctcttc 


ccagcgaagg 


1680 


cccctcctgc 


cctcaggcct 


cagtgctgct 


tccgtctcgg 


aaggccccag 


gagctcctgc 


1740 
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atcctgggcg 


tgattcctgt 


gtgcctgcag 


accccctcgc 


ggctgccatc 


tcatcctttg 


1800 


gtgcacctgt 


tggccagacc 


tcctggtagc 


gggtgctgca 


ctcccctgaa 


tgtgccgggg 


1860 


cctgggggca 


gggacctggg 


ctcctccctc 


actgagtgga 


gggaactcag 


tgtcttggag 


1920 


ttggggtgcc 


tgcaggctgg 


gtggtgcagg 


tgaaatgcag 


acctctcagc 


tggtgttcca 


1980 


gagcagctgc 


cttcccccgc 


ccgagggact 


tcacccgcag 


cccagtcagg 


ggtggcgcct 


2040 


gggtgcatcg 


cccgcaggct 


. gggtaggggt 


ggagcctggg 


tggccctgcc 


tgtgagctgc 


2100 


atagttgtcg 


cctttgaccc 


tgagttttct 


tcgttatctg 


tttggacctg tttggggcag 


2160 


gcaggggatg 


agatctgaag 


ataaatgcct 


tagctgtgac 


catctccttt 


tgtgagaggt 


2220 


caatgtccag 


ttccgctgca 


gttataacat 


cccatttttt 


gatttctttt 


tattttttcc 


2280 


tttttctttt 


tgagatggag 


tctcgctctg 


tcacccaggc 


tggagtgcaa 


tggggtgacc 


2340 


tcagctcact 


gcaacctcca 


cttctcgggt 


tcaagtgatt 


ctcctgcctc 


agcctcctga 


2400 


ctagcagggg 


ttacaggcgt 


gagccaccac 


gcccagctaa 


tttttgtatt 


tttagtagag 


2460 


gcaaggtttc 


gtcatgttgg 


ccaggctggt 


ctcaaactcc 


tggccttaag 


tgatctgccc 


2520 


gcctcggcct 


cccaaagtgc 


tgagatgaca 


ggtgtgagcc 


accgtgcccg 


gcccagaact 


2580 


ctttaattcc 


cacctgaaac 


ttgccgcctt 


aagcaggtcc 


ccagtctccc 


tcccctagtc 


2640 


cctggtccca 


ccattctgct 


ttctgtctca 


atgaatttgc 


ctaccgtaag 


tacctcatat 


2700 


aaattgaatc 


ataaagtatt 


tgtcttttta 


tatctggctt 


atttcactta 


gcataacatt 


2760 


cttaagtttc 


atccatgttg 


tagcatgtgt 


cagaatctct 


ctcttttttt 


tttttttttt 


2820 


tttttttttt 


ttttgcagac 


agagtctcgc 


tctgtcatct 


agactggagt 


tcagtggcac 


2880 


gatctcggtt 


cactgcaaca 


tctgcctcct 


gggtccaagc 


aattctcctg 


cctcagcctc 


2940 


cttagcagct 


ggaactacag 


gcgcgtgcca 


ccatgccttg 


ctaatttttg 


tatttttatg 


3000 


tggaggcagg 


gtttcaccat 


cttggccagg 


ctggtctcga 


attcctggtc 


ttcaccacgg 


3060 


gggcccgaag 


gacccgggca 


aagcgtggag 


gggagg 






3096 
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<220> 

<221> unsure 

<222> (12044) , (12489) , (26433) , (26434) , (26435) , (26436) , (26439) , (26441) 
<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 6 

gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcctggggc cccacggggc 60 

ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 120 

cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 18 0 

tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240 

taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 300 

agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 3 60 

gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 42 0 

tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 480 

ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 540 

agcaatggcc accttgcagg ttcagagagg gcttgcagta cctcacagaa ctgagtgccc 600 

gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660 

gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 72 0 

ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 780 

tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 84 0 

tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900 

ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960 

gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 102 0 

cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 1080 

cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 114 0 
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tctattttca 


agatagaata 


gacccagact 


cccagctctt 


cttacacttc 


tactactgcc 


1200 


acctgtcacc 


cactcatgcg 


fcccccacttg 


cagcctcgac 


ccccttccac 


ctgatctcat 


1260 


ggcagccagg 


gaagctccag 


ggctcgtgag 


ggctgccatc 


tcaggaaaga agcaaaagcc 


1320 


ttcggcacct 


gcagggcctg 


ctccaaccac 


acttcttcct 


tgacctctca gcttccttag 


1380 


ccactccctt 


cccacatctc 


accctgctcc 


agccacagtg 


gtgtctctgt 


gggttctcaa 


1440 


acacaccagg 


tgcactcctg 


cctcagggcc 


tttgtgcttg 


ctgttctctg 


ct gggactct 


1500 


tttttttttt 


tttttttttg 


agacagggtc 


tcactctgtg 


gcccaggctg 


gagtgtagtg 


1560 


gtgtgatcgt 


agctcattgc 


aacctcaaac 


tcctgggctc 


aagcaatcct 


cccacctcag 


1620 


cctctcaagt 


agttagcttt 


tgttgttttg 


ttttgagatg 


ggatctcact 


ctgttgccca 


1680 


ggctggagtg 


cagtggggca 


atcttggctc 


accacaacct 


ctgcctccca 


ggctcaagca 


1740 


attctcctgc 


ctcagcctcc 


caagtagctg 


ggattacagg 

ZJ ZJ ZJ ZJ 


catgtgccac 


cacgcccagc 


1800 


ttatttttgt 


atttttagta 


gagacagggt 


ttcaccatgt 


tggtctggct 


ggtcttgaac 


1860 


tcctggcctc 


agatgatcca 


cctgcctcgg 


cctcccaaag 


tgctgggatg 


acaggcatga 


1920 


gcctgtctct 


agtagttagg 


actacagaga 


ggggccatca 


tgcctggtga 


tcctcccacc 


1980 


ttttctgctc 


caactctttc 


accccactta 


qcctcqtqqc 

ZJ w «J ZJ ZJ 


tcactctctt 


acctcttcag 


2040 


ctcctcagtc 


aggcctgagg 


acccctgttg 


aaaattgcaa 


accacacccc 


ccaccaccac 


2100 


cacccactat 


tgccagcact 


ttctactcca 


tttctctgct 


ttacttttct 


cctttgtact 


2160 


catcaccacc 


tgactcatta 


catgtttacg 


tatctttctt 


ctctccacta 


gcatggaagc 


2220 


tccaggagag cagagagtgt 


agttttattc 


cctgatgtgt 


ttcctgtgcc 


cgtaccaggg 


2280 


cctagcacac 


agtaggtgct 


cagtaaatgt 


gtgttggatg 


aacaaataca 


gtgaaaggat 


2340 


ctgatctaca 


tttataaaga 


aggcactctg 


qctqctqaqt 

ZJ Z3 ZJ ZJ 


ggggatgaga 


ctgtcaggag 


2400 


gaaagaggcc 


cctgtggggg 


cctggccagc 


aqqtqqqtac 

ZJ ZJ ZD ZJ ZJ 


aatggtagca gccaggagag 


2460 


agggcctctt 


ggactcaagt 


ggatggggcc 


tqctcaqqqc 

J ZJ ZJ ZJ 


tccggccaca 


ggaacaaagg 


2520 


gaagggggcc 


caggatggcc 


tgtcatagag 


gacacattac 


aactggccca 


aagttcaagt 


2580 


caggtttcta 


aatttgggaa 


gggatacaga 


aaaactaaag 


actctactgg 


acagtcagtt 


2640 


attgaaatga ttacatagaa 


aatgtaccaa 


gaattaaaaa 


aaaaaaaaaa 


aagcattatg 


2700 


aaggggccac 


cagagactcc 


cagagaggaa 


agggactatg 


ggctggatgc 


ggtgactcac 


2760 
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acctataatc 


ccagcacttt 


qqqaqqccqa 


qgagggtgga 

3 3 ZJ ZJ ZJ ZJ Zj 


tcacgaggtc 


aggagttcaa 


2820 


aaccagccta 


ggcaacatgg 


taaaaccccc 


gtttctacta 


aaaatacaaa 


aaattagctg 


2880 


era c a t qqc acr 


catgtgcctg 


taatcccagc 


tactcqqqaq 


qctqagqcaq 

ZJ ZJ ZJ Zj ZJ 


qaqaqttqct 

ZJ ™"~ ^^3 3 


2940 


agaacccagg 


aggcagaggt 


tgcagtgagc 


cgagattgag 


ccactatgct 


ccagcttggg 

3 V w ZJ Zj ZJ 


3000 


cgacagagca 


agactccgtc 


tctaaaaaaa 


agaaaaaaaa 


ggccagatga 

zj zj 3 zj 


ggtggctcat 

<J ■ — ' ZJ Z3> 


3060 


gectgtaate 


ccagcacttt 


qqqaqqccqa 


qqtqqqtqqa 

Dj w 333 w 3 3 ^ 


tcacgaggtc 


aggagatcga 


3120 


gaccatcctg 


gctaacatgg 


tgaaactcca 


tctctactta 


aaatacaaaa 


aattagcegg 


3180 


QC Q t CTQ t OTQ C 

ZD ~J ZJ ZJ *~ ZJ ZD ^~ 


qqqcacctqt 

333 3 w 


agtcccagct 


acttqqqaqq 


ctqaqqcaqq 

^Zj ZjZJ ZJZj 


agaatggcgt 


3240 


gaacctggga 


ggeggagett 


geagtgagee 


gagattgege 


cactgcactc 


catccagcct 


3300 


qggcgacaqa 

333 3 3 


gttagactcc 


gtctcaaaaa 


aaaaaaaaaa 


aaaaaaatta 


gctgattagt 


3360 


tqqqcttqqt 

333 vv ^ v ^?3 *•* 


qqcqqqcqcc 

33 V 333 IJ W 


tgtaatccca 


actacteggg 


agqetgagge 

33 ZJ ZJ ZJ 


gggagaatca 


3420 


cttgaacccg 


qqacrqeacfag 


gttgcaatga 


gecgatatea 


cgccactaca 


ctccagcctg 


3480 


crcrcaacaqaq 

33^3"-^"3 »3 


caagactcca 


tctcaaaaaa 


gaaaaaaaaa 


aaqaaaqqqq 


ctqtqctqtq 

J 3 ZJ 3 


3540 


gcctgggacc 


caaagcacac 


tactgeaagg 


tcccaqqqtq 


cctgactcca 


accggagcct 


3600 


tgagaacatt 


catttgeaaa 


gaatgaatta 


aaattcagca 


ctattttatt 


ctgeaggatt 


3660 


ccagcacccc 


aaggacagtc 


atttttagac 


ccttcagtaa 


cgtaataagt 


aaceggagga 


3720 


tqtqctqaac 


ttccacttcc 


ecagaeggtt 


gcctgtcaca 


gctcatcagg 


ccaacaaact 


3780 


tttcttaggc 


ctcaaatttg 


gaaatgttca 


ctctcagttc 


gttccttaga 


tgcaagtcca 


3840 


tcccaatgaa 


qtaacaqqqq 

3 **3 3 3 3 


ctcagcacct 


gtccaatctc 


attgettccg 


qqqa c aqqqq 


3900 


cccatgagga 


tgtcgtttca 


gcccggtgac 


acttgggcaa 


agtgcctttt 


ggtttccctc 


3960 


ccaggctqqa 

ZJ ZJ 33 


acqtgctgqc 

3 "~ j ~" 3 3 w 


tctgtgaagt 


tacgetggge 

ZJ 333 


acaagagccc 


cccccaaccc 


4020 


qqcaqqactq 

33* rf "^'33* — ^ 3 


actqctqtqq 

3 3 33 


teagaqqege 

w 3 33 3 


ccctgqggct 


ttqqqaqcca 


cagaatcttc 


4080 




cac ccrcraQQa 


qqccccaqtq 


agagtgecca 


ctgccaggct 


cattcctcag 


4140 


gctgccgcag 


gcctctcccc 


aaaacaggca 


atgettctea 


gcaacctgcc 


ccaggagcag 


4200 


gecagggaag 


gccgccatcg 


gectacagtg 


ctgggctctg 


gagggcttgg 


ttggtaacag 


4260 


gccatggttt 


etatgageca 


gctggggtgt 


gaaggacaca 


ggctggattc 


acctctctgg 


4320 


gcctcagttt 


ctgcattcaa 


aaagtgggaa 


tcatgatatc 


tgctctattt 


cttatctctc 


4380 
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agtgctgatg 


tgaacctcca 


ataagacttt 


taaaaatact 


ctttctacct 


tacttttatt 


4440 


tttcatttat 


tttaagataa 


tgtctagctg 


tctcacccag 


gctggagtgc 


agtggtgtga 


4500 


ttacggctca 


ctacagcctt 


aacctcccag 


gctcaagtga 


tcctcctacc 


acagcctccc 


4560 


aagtagctgg 


aactacaggc 


atgcaccacc 


gcacctggat 


aattttttct 


tttgagacaa 


4620 


ggtttcactc 


tgttgcccag gctggagtgc 


agtggtgcac 


tcttggctca 


ctgcagcctc 


4680 


aacctccctg 


ggcttaggtg 


atcctcacac 


ttcagtctcc 


caagtagctg 


ggactacagg 


4740 


tatgtgccag 


tacacccagc 


taatattttt 


gaaggatggg 


gtttcactat 


attgcccagg 


4800 


ctggtcttga 


actccagggt 


ttaagcaatc 


taccttcctc 


agcctgccaa 


agtgctagga 


4860 


ttataggtat 


gagccacccc 


ccggcctata 


atcctaccac 


tttaaaaaag 


cctgtaattt 


4920 


tagcacttta 


aaaaattttt 
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ccagctactc aggaagctga ggcaggagaa tggcttgcac ctgggaggcg gaggttgcag 

tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 

aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 

catgtgaaga tgaggcagag gttggagcca tgcagccaca agccaaggaa tacctggagc 

cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 

ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 

tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 

acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 

tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 

aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 

attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 

gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 

agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 

agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 

catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 

aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 

gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 

caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 

cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 

cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 

cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 

ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 

tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 

ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 

caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 

gcacactggt gcaaggggca ggctcccaag gccttgggca gctctattcc tgcggctttg 
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agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 
agcaccccat gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 
ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 
acaccttttc ctgatcatct gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 
ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca aacaccagac 
atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 
gatgaacaag agagggacaa gtccccaagc ctctctctcc ttcctgcctc acccactccg 
ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 
tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 
gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 
atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtcacttaca 
agtctttttt tttttttttt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 
cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 
ctcagcctcc tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 
atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 
ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 
cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 
tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 
ctgatgctat taagaattga gggctgggag cggtggctca cgcctgtaat cccagcactt 
tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 
gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 
aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 
cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 
tggtggccfcjg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 
gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 
gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 
ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 
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tagcttggct 


1512 0 


gtgctgggaa 


ttaagtaaga 


tgaccgggta 


gcaaatatga 


agtattactg 


gacacagagg 


15180 


gccccaggct 


gggttagcag 


cggtggtcag 


ggctgctgct 


tcctggcctg 


agctcgaagg 


15240 


agggccctca 


ttaccacctg ggtgagtcct 


cgtccaagcc 


tggcactgct 


gcgtgggaat 


15300 


aacttctgcc 


acccaagttg 


gcagattgtg 


tgcaaagtta 


agtcctgact 


ctgtggggtg 


15360 


gacttcgagg 


cctcttcatc 


ggacctgctt 


ccggtgactg 


cattcgcacc 


tcctcctgtt 


15420 


cctggtttaa 


cacagcccag 


ctttcctcct 


gctgagccct 


ccctgggcct 


gctgtcaccc 


154 80 


tcgtgccgct 


gtgcctcgca 


gtgccactcc 


ctgtaccctg 


aatactttgc 


cctgcctctc 


15540 


cacccagctg 


agagtcaggg 


cccctgtgag 


gctctgccca 


gcccgtcctc 


cgggtttctg 


15600 


cctctgctga gcacttccct 


gcatgattgc 


ttctgagagt 


ccccccagcc 


tgtgagcttc 


15660 


tcaggactgg 


gacagcttct 


caggaccgag gcttcctggt 


ctgcttgcaa 


ttttacaggc 


15720 
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gggcacattt 


tcccttggcc 


aacatcagag 


actggacatc 


tgcagatctg 


tgctagccac 


15780 


tgagcaccca 


ggcaccccag 


caggtagctc 


tgtaaccaac 


ccattctgta 


aagctgaggc 


15840 


tcagagaggt 


gaagcgcctg 


gcctggggcc 


acagcctgcg 


tcagctgcag 


agccaggagc 


15900 


tgagatatgc 


acctgcggct 


ctgctcacag 


ggtcctgcac 


agactgctgc 


tggagccacc 


15960 


tatgtagagt 


caagagagtt 


catgttaact 


ccctctcaca 


tccctcagcc 


■mJ *J — ' —J S » 


16020 


ctgacgatag 


acactcaggg 


atggcctacc 


ctccccaaca 


acccccgtca 


ggtttgccgg 

ZJ ZJ ZJ ~J ZJ 


16080 


atctccttgg 


aagaaaagtt 


ctgggcagaa 


ttccaccgtt 


ggcctggcct 

ZJ ZJ ZJ ZJ 


acactctcct 


16140 


tagtggctta 


ggaccctcag 


cggtggataa 


gttgtgggca 


gaagagatgc 


aatcaggatt 


16200 


ctcacccact 


caccccttgc 


cagccccaat 


aagctcaata 


agctgggctc 


ggtctgagga 


16260 


agtgtccagg 


aaatgtgcaa 


atggcctggg 


acagccctgt 


gttcctttca 


gtaaggttgc 


16320 


tgaaggtgag 


gctgaaagtt 


ggagaaacag 


aagccagtgc 


ttatggtttt 


aattaagata 


16380 


atggaatgta 


tgtatgtatg 


tatgtatgta 


tgtatgtatt 


tatgtattta 


tctttagaga 


16440 


tagagtctca 


ctctgttgcc 


caggctggaa 
— > — > — > —* 


tgcggtgaca 

ZJ ZJ -ZJ 


caatcatagc 


tccttgcagc 


16500 


ctcgacttcc 


tatgcccaaa 


tgatcctcct 


acctcagcct 


cctgagtagc 


tgggactaca 


• 16560 


gacacacgcc 


aactatgcct 


agctaatttt 


tatttctatt 


ttttgtggag 


actgggttct 


16620 


cactttgttg 


cccaggctgg 


tcttgaaccc 


ctagcttcaa 


gcaatcctcc 


tgcctcagcc 


16680 


tcccaaagtg 


gagqqattac 

~J —JZJZJ 


aggtqtqaqc 

ZJ ZJ ZJ ZD ZJ 


caccacacct 


qqcctggaat 

ZJ ZJ ZJ ZJ 


ttatttgtat 


16740 


tctgcttata 


aaattaatac 


attcttattg 


cagaaaagtt 


tgaaaataaa 


agaaaggaca 


16800 


aagaacaaaa 


agcgtatata 


atttcacagc 


tcagatctca 


ctgctattaa 


catttttatt 


16860 


tactttcagg 


cttttttctt 


tctaggtaca 


tatgcagaga 


ttattttatt 


ttatttattt 


16920 


tattttatat 


tttattttat 


attttttatt 


tcattatttt 


attttatttt 


attttattat 


16980 


ttttagagac 


agggcctcac 


tctgtcaccc 


aggctggagt 

ZJ ZJ ZJ ZJ 


acaatggagt 


gatcatagct 


17040 


cactgcagcc 


tcaaacacct 


gggctcaagc 

ZJ ZJ ZJ ZJ 


aatcccccca 


ctcagccttc 


tgagtagttg 

ZJ ZJ ZJ ZJ 


17100 


ggactaaagt 


gtgagtctgg 


ctaatttttt 


ttactttttg 


tattgacaga 


ggtctcacta 


17160 


tgttgcccag 


gctgatctca 


aactcctggg 


ttcaagcgat 


cctcccacct 


tggactccca 


17220 


aagtgctggg 


attacaggca 


tgagccacca 


tgcctggcct 


aaaatgccac 


tttttgtcat 


17280 


ttactaaaat 


cccatggaca 


ctttgacatg 


tctgtattct 


atgctattga 


tctgactgtt 


17340 
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ggcatctaca 


tcattatggc 


catctatcat 


ctatcataat 


ccattttaac 


attaaaattg 


17400 


tgctgctgct 


tagatttttc 


tggcctgtct 


cctatttgta 


ttcttccaga 


taaattttag 


17460 


aatcatttta 


tcaaattccc 


cttgcagaaa 


aagccctatt 


ggattttggt 


tgaaaaatac 


17520 


tgaattttta 


cattaactta 


ggaaagggct 


gggcacggtg 


gctcacgcct 


gtaatcccta 


17580 


cacttttcga 


ggccaaggca 


ggtggatcac 


ttgaggttgg 


gagtttgaga 


ccagcctggc 


17640 


caacatggtg 


aaactcggtc 


tttactaaaa 


atacaaaaat 


tgccaggcgc 


attggctcac 


17700 


ctgtaatccc 


agcactttgg gaggccgagg 


tgggtggatc 


acgaggtcag gagatagaga 


17760 


ccatcctggc 


taacacggtg 


caaccccgtc 


tctcctaaaa 


atacaaaaaa 


ttagccaggc 


17820 


gtggtggtgg 


gcgcctgtgg 


tctcagctac 


ttaggaggct 


gaggcaggag 


aatggtgtga 


17880 


acccaggagg 


cggagcttgc 


agtgagccaa 


gatcgcgcca 


ctgcactcca 


gcctgggcga 


17940 


cagagtgaga 


ctccatctca 


aaaaaaaata 


ataataataa 


tacaaaaatt 


agccgggggt 


18000 


cgtggcgtgc 


acctataatc 


ccagttactt 


gggaggctga 


ggcaggagaa 


tcgcttgaat 


18060 


ccaggaggtg 


gaggttgcaa 


tgagcagaga 


tcgtgccact 


gtactccagc 


ctgggtgaca 


1812 0 


gagtgacact 


ctgtgaaaaa 


aaaaaaaaaa 


ttctgaagga 


ttgagactct 


tagactctta 


18180 


ggtcttccta 


tccaagagca 


caatatagct 


tttcatgtat 


tcaagccttt 


ttcaatgcat 


18240 


caacagaatt 


ttacagtttt 


tttcatgata 


tcctgctatt 


tcttataaaa 


tgtattccta 


18300 


gatattctgc 


atgttttccg 


gttgtttgtt 


aataaatatt 


tttcatttgt 


cattatttcc 


18360 


taattggctg 


ttatttgtat 


atatgacatc 


tgttgaattt 


tttgattact 


ttgaaaatgg 


18420 


ccattctttt 


gtgttttttt 


ttaactttct 


attttgagat 


aattttgact 


tacagaagat 


18480 


ttgcaaaaat 


agtacagaga 


gttcctgttt 


cccccttatg 


ttaacccagt 


ttctccttat 


18540 


gttaacatct 


tacataacta 


cagaacaatt 


gtcaaatcta 


agaafccaacc 


tgggcacaat 


18600 


gctattaact 


aaactgcaga 


agctgttcag 


atctcaccag 


ttcttctact 


gctccccttt 


18660 


tctcttccag tgttcaatcc 


ggaatcctac 


attatattta 


gttgtcattt 


ctctttggtg 


18720 


tcttccaatc 


tgtgacagtt 


cctcagtctt 


tctttgtctt 


tcatgacttt 


cattttttta 


18780 


tacttttgaa 


aaatactggc 


cggttgtttt 


gtagaacgcc 


ctcagtttgg gtttgcctga 


18840 


agttttttgt 


gattagatcg aggtcatgca 


ttattggaga 


gggtgccacc 


gcctcgatgt 


18900 


gcaagctcaa 


tgcatcatat 


cagagggttt 


gtaatgtcag 


tttataccgc 


cggagaccct 


18960 
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aacctggagc 


atttcgtgaa 


ggtgctgtct 


gccaggattc 


tccactagaa 


agttactatt 


19020 


tttccctttt 


taattactga 


atgtctgagg 


ggaaatactt 


tgagactatg 


caaatatcct 


19080 


gtttctgctt 


taacttcggc 


tcactaagtt 


tagcattcat 


ctatggatct 


cgcttatagc 


i9140 


aagtattact 


gtggagttct 


aatggtaatt 


ttctgtttct 


ctcattcctt 


caacctttat 


19200 


taatatgctt 


cttcctcact 


tattcatttt 


gtttcagttg 


tttataccaa 


catggatttg 


19260 


tggatattgg 


ttttattctt 


tgggttgcaa 


ttgaatccta 


tcattatttt 


gttagtcagt 


19320 


tgttccatcc 


gaccttggtc 


attaggagcc 


cttgaaattt 


ggctcccatg 


cctttttttt 


19380 


tttttttgag 


accgagtctc 


actctgtcac 


ccaggtttga 


gtgcagtggc 


atgatcttgg 


19440 


cttcctgcaa 


cctccgcctc 


ccaggttcaa 


gcaattctcc 


tgcctcagcc 


tcctgagtag 


19500 


ctggtattat 


aggcgctcca 


ccaccttgcc 


cggctaattt 


tttgtatttt 


tagtagagat 


19560 


ggggttttat 


tatgttggcc 


aggctggtct 


caaactcctg 


acctcaggtg 


atctgcccgc 


19620 


ctcggcctcc 


caaagtgctg 


ggactacagg 


cgtgagccac 


cacacctggc 


ctcctatgcc 


19680 


attttaacat 


gcccgtcttt 


tctttttctt 


tcctactttc 


tgtgactgta 


agaagctcca 


19740 


ggatacattt 


ttgctgccct 


agacttagcc 


tcaatcagtt 


ctcagaaaag 


ctctggttct 


19800 


ttttatggga 


tacttagaaa 


actagctctg 


tatggcctgg 


cgcggtggct 


cacgcctgta 


19860 


atcccagtac 


tttgggaggc 


cgaggtgggc 


agatcacaga 


tcacgaagtc 


aggagatcaa 


19920 


gaccatcctg 


gctaacatgg 


tgaaactctg 


tctctactaa 


acatacaaaa 


aattagtcca 


19980 


ggcgcggtgg 


cgggcgcctg 


tagtcccagc 


tactcaggag 


gctgaggcag 


gagaacggca 


20040 


tgaacccggg 


aggcggagct 


tgcagtgagc 


cgagatcggc 


agccactgca 


ctccagcctg 


20100 


ggccacagag 


cgagactccg 


tctcaaaaaa 


aaaaaaagga 


aaaagaaaaa 


agaaaactag 


20160 


ctctgtatgc 


tagttttttt 


tttaagacag 


ggtctctctt 


gccccagctg 


gagtgtagca 


20220 


gcacgatcac 


agctcactgt 


agcctcaacc 


ttctgggctc 


aagcaatcct 


cctgcctcag 


20280 


tctcctaagt 


agctgggtct 


acaggcatgc 


accaccgtac 


gtggcaattt 


ttaaaaactg 


20340 


tttgtagaga 


tggagtctcc 


ctatgttgcc 


tggtctggaa 


ctcctggcct 


caagtgatcc 


20400 


tcctgcctcg 


gcctcccaaa 


gtgctgagat 


tacaggcatg 


agccactgta 


cctggcctgg 


20460 


ccaaggtctg 


tcttttttta 


aaagaagttg 


ttgtatagtt 


gttttttttt 


ttattttttt 


20520 


ttctgagacg 


gagtctcgct 


ctgtcgccca 


ggctggagtg 


cagtggtgcg 


atctcggctc 


20580 
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actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 
ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 
tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 
ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 
atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 
cctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 
taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 
tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 
taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 
cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 
ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 
accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 
gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 
tacaggcgtg agccaccgcg cccggcgtcc atactttctt tagatatcct tcctttttac 
ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 
cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 
cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 
ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 
gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 
catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 
gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 
ctgagaaaca tcaagcttta aagcactggt tctcaacctt tttggtctca ggagcccttt 
atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 
atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattcat ttaaaataac 
aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 
atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 
tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 



20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
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tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 

actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 

tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 

cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 

ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 

ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 

attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 

aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 

ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 

tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 

aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 

cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 

caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 

cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 

tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 

ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 

gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 

aggtgggcta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 

atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 

aactggaaac aaacagatca gatgtctacc acttcttcga gggaattgca ttgccaaaga 

agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 

taaacttgca tgggcaggaa gtgggctgtc aaagctgttc atcctctaag gtggacctag 

ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 

tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 

ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 

ataaggtctc gctctgttgc ccaggctgga gtgcagtgac atgatcatgg ctcactgcag 

ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 
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22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
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catgccacca 


tgcttggcta 


tttttttttt 


cagtagagat 


agggctctca 


ctatgtfcgcc 


23880 


agggctgatc 


tcaacctcct 


gggttcaaga 


gatcctccta 


cctcagctcc 


tgagtagctg 


23940 


ggattcgggt 


gcacaccacc 


atgccaacta 


atttttaatt 


tttttttgta 


tqgacacrqat 

ZJ J ZJ ZJ 


24000 


gtacagtgtt 


agaaatggat 


tgcttgcaga 


ggcaggagga 


tcacttgagc 


ccaggagttt 


24060 


gatcacactg 


tgaaccatga 


tcgcacccct 


gcactccaat 


ctgggcaaca 


qaqtqaqacc 


24120 


ttgtctcaaa 


aaaaaaaaaa 


aagagagaga 


gagagagact 


caaagatagg 


caaaaaagtg 


24180 


ggaaagcttt 


atagtggaca 


aaaaggaacg 


ctctaagtct 


gccctattgg 


catqqtqctq 

ZJ ZJ ^ ZJ ZJ 


24240 


aaggtgggct 


aactagagat 


agggggtact 


atgtggttga 


ctatgggtgc 


atctttggct 


24300 


ttccctgggt 


gatcctaagt 


tggaagcagg 


gacaaaaatt 


aqqqaaqctq 

ZJ ZJ ZJ ZJ ZJ 


ttagttattc 


24360 


atcacgttct 


ggcagtagtg 


gactggttgt 


gatagaagtt 


attgttttgg 


ccaggtgcgg 


24420 


tggctcatgc 


ctgtaatcct 


agccctttca 


gagttcaacg 


tqQQtqqatc 


acfaaaqcraqq 


24480 


gaggatttgg 


gaggtcagga 


gttagcctgg 


ctaacctggc 


gaaatcccat 


ctctactaaa 


24540 


aatacaaaaa 


ttagctgggc 


gtggtggtgc 


atgcctataa 


tcccagctac 


tcgggacgct 


24600 


gaggcaggag 


aatcagttga 


acctggggag 


gcggaggttg 

J ZJ ZJ ZJ ZJ ^ ^ ZJ 


cagtgagcca 


agatcgtgcc 


24660 


caatttcatc 


tcaaaaaaaa 


aaaaaaagtt 


atcgtttagc 


ttcctcgatt 


gttactggac 


24720 


gtagtaatct 


ggcttcctgc 


aagtctaact 


ttcagcagac 


tggctacatg ggctgtgtac 


24780 


tgtagataag 


gcagtaagta 


aagcaaaaat 


tgatagagca 


tcaaggataa 


atagaaaatc 


24840 


cgtaatcaag 


cagaagattt 


gaacacttca 


ctttcagtaa 


ctgataaaac 


aagtagacaa 


24900 


aaaaaatcag 


taaggatgta 


gaagatttga 


acaacgtaat 


taacaaactt 


gacttgattt 


24960 


acacgtctag 


aaccctgcag 


aacacacact 


ttttcaagca 


tactcagaac 


atttatataa 


25020 


agtgaccata 


tggtggacca 

ZJ ZJ ZJ ZJ 


taaagcagtt 


tcaacaaatc 


tcacaggagt 


aaaataacag 


25080 


accgtgtttt 


ctgaccgtaa 


gtacagttaa 


cctagaaatt 


gaaaacaaaa 


agctagaaaa 


25140 


accccatgta 


tctggaaatt 


ttaatataca 


cttfcgaaata 


acaaatggat 


cagagattaa 


25200 


ttcaaatagg 


aatttagaaa 


taccttgaac 


tgaaaaataa 


tgagaatact 


ataccccaaa 


25260 


actgtggggt 


gcagctgaac 


agtatataga 


cgaaaagtat 


actcatatgt 


gcatacctta 


25320 


aggagcgggg 


aggattgaaa 


gttaatggga 


ggcaaaagca 


ggtggatcac 


ttgaggttag 


25380 


gagttcaaga 


tcagcctggc 


taacagggtg 


aaaccccatc 


tctactaaaa 


atacaaaaaa 


25440 
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ttatccaggc 


gtagtgaggc 


tgaggcaaga 


gaatcgttgg 


aacccaggag 


gcagaggttg 


25500 


cagtgagccg 


cgattgcgcc 


actgcacccc 


agcctgggag 


acagagcgag 


actccatctc 


25560 


aagaaagaaa 


aaaaaaaaag 


aaaaggccag 


gcgcggtggc 


tcatgcctgt 


aatcccagca 


25620 


ttttgggagg 


ccgaggtggg 


cggatcacga 


ggtcaggaga 


tcgagactat 


cctggctagc 


25680 


acggtgaaac 


cccgcctcta 


ctaaaaatac 


aaaaaaatta 


gccaggcgtg 


gtggcgggtg 


25740 


cctgtagtcc 


cagctactca 


ggaggctgag 


gcaggagaat 


gtcatgaacc 


caggaggcag 

ZJ ZJ ZJ ZJ ZJ 


25800 


agcttgcagt 


gagccgagat 


cgcgccactg 


tactccagcc 


tgggcaacag 
_ > — * j zj 


agagagactc 


25860 


tgtctcaaaa 


aaaaaaaaaa 


gttaatggga 


taaacatcca 


tctcaagaag 


ttagaaagga 


25920 


atgacaaata 


aaccaaaaaa 


aaaaaaatca 


aaagaagaaa 


atcataaggt 


caagactata 


25980 


aagagagtgg 


ctgggtgcag 


tggctcaggc 


ctgtaatctc 


agcattttgg 


gaagcagagg 


26040 


tgggcagatc 


acttgagccc 


aggagttcaa 


gaccagcctg 


agtaacatag 


agagacctca 


26100 


tctttgctga 


aaataaaaat 


aaaaaattag 


ccaggcatgg 


tggtactgag 


gtgggaggat 


26160 


cacttgagcc 


taggaggttg 


aggctgcagt 


aagccatgat 


tgtgccactg 


cacttcagcc 


26220 


tgggtgacag 


agtgggaccc 


tgtctctaaa 


aaactaaaat 


aaggctgggc 


gcggtggctc 

ZJ ZJ ZJ ZJ ZJ 


26280 


aaatctgtaa 


tcccaccact 


ttgggaggcc 


aaggctgagg 


tcagcagttt 


gagaacagct 


26340 


tggccaacaa 


gatgaaacct 


catctctact 


aaaaatacaa 


aaaattagtt 


gggtgtggtg 

ZJ ZJ J J mJ -J ZJ 


26400 


gcatgtgcct 


gtaatcccag 


ctacttagga 


ggnnnnctnt 


ngattatatt 


ttctccttcc 


26460 


tacgtcgtta 


ttggactgaa 


ttcagaatga 


tgactctcat 


tggagctctt 


cctgtctcct 


26520 


aactacagtg 


gcttccgacc 


ccactctggt 


tttcacttca 


cccctctgct 


gctcatacga 

3 •ZJ 


26580 


gtagatactt 


ccttccttct 


ttctcacttg 


ttgctcttcc 


tcaacccccc 


ccgttggtgt 


26640 


cccctcctct 


ttatcttttt 


ctcgcgacac 


ctgcgttctc 


ttgccctctt 


atcatccctt 


26700 


tctcgaggcg 


gtcctttcct 


ttatccagct 


taaatacctt 


ctcctctgtt 


tatttggggg 


26760 


ttgggttttt 


atctctcacc 


ctccctctaa 


tttctttcct 


ctttccgcac 


ccatcaagcc 


26820 


tctcgtggtt 


tctcttcctc 


tactctcggg 


tcccccccct 


ctccccttct 


ttttttcttc 


26880 


acccccccaa 


gcgctttgcc 


ttttttttct 


ttgcccttta 


ttcccccc 




26928 



<210> 7 
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<211> 29430 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (4336) , (4345) , (4349) , (4392) , (4447) , (4490) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 7 



aggggaaggg 


ccggctccgt 


aactcacacr 






aggagagagg 


o u 


atcatctcag gccaggagtt 


caagaccagc 


c t aaaczi a fa 


c cty \— c*. ay ct l> o 


y v_ ct L- 1- 1_ O l_ 


ion 


aaaaacttct 


tttaaagctt 


uaaaacmcicici 


daadday ud.u 


a-9 «■ g 3"- c cty u 


l. u cigy cty cicia 


xou 


agcctgtaga 


ggcagcacac 


taaggaggag 


acgcagccca 


ggcaccagga 


ggggctggcc 


240 


atgggcactc 


actcctccag 


caggcgagtg 


cccagcacca 


gctggcccac 


ccagacaccc 


300 


aggacacggc 


ctgaatggct 


ccgtattcac 


gtgggtggta 


ataaacaagc 


aatacacata 


360 


gccaataagg 


acaccttagt 


aatgttacat 


cataaacgct 


gcagatcagg 


gaaatggtgc 


420 


agggtgaagt 


gggttggggg 


gctgcatgct 


acatgagaag 


tgggtcgggg 


ggctgcatgc 


480 


tacctgagac 


agagcaggcc 


ttgctgggaa 


agaaggagcc 


ggcaggcctg 


ggcaaaggtc 


540 


ctggggtggg 


agcacactgg 


agcagagtgt 


gggggtagca 


tggcgggtgc 


tggtcctctg 


600 


ggcgccttcc 


caccacgtca 


tgtgcccatg 


tgcccaaggt 


ctctcgtttc 


acagccccct 


660 


gaagctcagg 


ggtcacagct 


acacagcccc 


cagatacctt 


ggcctgcccc 


aggtcattcc 


720 


atccagtgat 


ggacctgctg 


acctctagcc 


tgacctctgg 


gcagcgtaat 


ttgagaagga 


780 


ggagaaggga 


gggcaacaga 


cctggggcga 


tgagggatgc 


acagggtggc 


agacacctga 


840 


ggctgcacct 


tggagcctca 


gttctgggtg 


tgggtggggg 


atggacaggc 


tgagggctga 


900 


agcagctggg 


cccggccacc 


atcacacccc 


aggacccacc 


agatcaccat 


gaaaaaccga 


960 


atgtcaactg gcagcccaga 


gtgcagaaca 


aacctttcag 


aaacacggtg 


gtgactgccg 


1020 
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catcatgaac 


ataaaataat 


tacgccctct 


ccccagggat 


cacccctgca 


ggagtttgtc 

Zj zj zj ^ 


1080 


ccaagaaaca 


ccagaaagaa 


ggaaaacgtc 


tgagtcacaa 


tatttgetga 


ggccttattt 


1140 


gtaatagcaa 


aaaaaaaaaa 


aaaaaaagaa 


caatctccag 


eggcaggggt 

— ' ~J — * —1 ~J ZJ 


aactagacta 


1200 


ttgtctccgt 


ggaaaggtag 


caccaattaa 


ctagtaacaa 


aatgactgcg 


gtaacaacaa 


1260 


aacgttcgac 


atgtcaacac 


caaaaaccac 


acacccagca 


taaccgtgaa 


ccatgatttc 


1320 


tactagaatg 


aatggcagtt 


atgagaaagc 


accageggag 


acaaagattg 


aaaaagtaaa 


1380 


ggtggcctca 


ttagggagac 


aagtctctgg 


gtaatatatt 


gtaatactgg 


taaatatata 


1440 


gtttttaata 


tattttttaa 


ttccaaattc 


catatatgtt 


cctatgaagc 


tatttctgea 


1500 


aatatttttt 


tcaggaccgt 


acatcacaaa 


ggcaaaaggg 


ccaggtcagc 


tctccagctg 


1560 


agagtgacca 


cttcagagca 


gacggcagac 


tccagggtta 


qcaaqcctqq 

ZJ Zj *~ZJZD 


ctgagacctg 


1620 


gcccatgaca 


atcactcaac 


ccctctgacc 


tcaacatcct 


gtctgtgaaa 


tggggataat 


1680 


tactgcacct 


ccacatcaca 


qaqtqcqagg 

*~Zj)^~zj ^^ 


cttaaacagg 


atgettcata 


gaaaageget 


1740 


caagaggtaa 


cagccgggag 


ggggtagtgg 

ZJ ZJ ZJ ZJ ZD ZJ ZJ 


ttttcattaa 


ttaaatgttg 


ccttcatcca 


1800 


gccctgggcc 


agctccaaca 


caaagcacac 


accatccact 


cagactcagt 


tgcctggatt 


1860 


caaagcccgg 


cctggcctcc 


age t gt gaga 


ttccgggcag 


gatttcccat 


ctcccagagc 


1920 


ctcagtttcc 


tcattcatga 


aacaggaagt 


gatcattcct 


tttattttta 


tttttatttt 


1980 


tattttgaga 


cggagtttca 


ctctagttgc 


Ccaqqctqqa 

ZJ ZJ ZD Zj 


gtatqatqqc 

ZJ Zj ZJ Z3 


gcaatctcag 


2040 


ctcactgcaa 


cctcggcctc 


ccagtttcaa 


gcgattctcc 


cacctcagtc 


tcctgagtag 


2100 


ctgggattac 


aggcacacgc 


caccacgccc 


agctaatttt 


gtatttttag 


tagagaeggg 


2160 


gttttgccat 


gttggtcagg 


ctggtctcga 


actcctgacc 


tcaggtgatc 


cgcccgcctt 


2220 


ggcatcccaa 


aqtqctqqqa 

Zj Zj ZJ ZJ ZD 


ttacaggtgt 


gagccaccaa 


gcccagttga 


caactgettt 


2280 


taaagacacc 


tctggctgct 


gtggaaaaca 


qcctqqtaqt 

ZJ ZjZJ -3 


gectcaaaaa 


gttacacata 


2340 


gaatgatcct 


atgaccagta 


attccactcc 


tacatatata 


cccaaaagaa 


ctgaacccct 


2400 


ctactcatgt 


atgtacacat 


acaggtacac 


gcatgttaac 


agcagtgttc 


acaaagecaa 


2460 


aacatggaaa 


cagctcaaat 


gtccataacc 


gatgaaegga 


taaatgaaac 


gtagtctatt 


2520 


caccacctga 


cggaggtgag 


aggggecata 


aaaaggaatg 


atgcataaaa 


acgaatatta 


2580 


tggccaggta 


tggtggctca 


cgcctgtaat 


cccaggactt 


tgggaggctg 


aggegggegg. 


2640 
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atcacgaggt 


aaggagttcg 


agaccagcct 


ggccaacacg 


gtgaaacccc 


atctctacta 


2700 


aaaatacaca 


aattagctgg 


gcatggtgga 


qqqcqcctqt 


aataccagct 


actccggagg 


2760 


ctgaggcaag 


agaatccctt 


gaacctggga 


aacagaggtt 


gcaqtqaqct 

^* zj **~zy 


gagattgcac 


2820 


cactgcactc 


cagcctgggc 


gacagaccaa 


aactccgttt 


cggaaaaaaa 


agaaaaaatt 


2880 


agccaggtgt 


ggtggcgggt 


gggtccctgt 


aatcccagct 


ctacttggga 


tactgaggca 


2940 


ggagaaccac 


ttgaacccgg 


gaggtggagg 


taqcqqtqaq 


ctqaaattat 


QCCactQCQC 


3000 


tccagcctgt 


gtgacagaag gagactctgt 


ctctaaaaaa 


caaaaacaaa 


aaaggcccga 


3060 


cgcggtgtct 


tacacctgta 


atgccaacac 


tttqqqaaqc 

Zj Z3 3 ZJ 


caaggcaggc 


agatcatctg 


3120 


aggtcaggag 


tttgagagca 


gcctgggcaa 


cacggtgaaa 


ccccatctct 


actaaaaata 


3180 


cagaaattag 


ccaggtgtgg 


tggcacatgc 


ctgtaatccc 


agctactcgg gaggctgagg 


3240 


caggagaatc 


gcttgaaccc 


aggaagcgga 


ggttgcagtg 


agccgacatt 


gcaccattat 


3300 


actccagcct 


gggtgacaga 


gtgagattct 


gtctcaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


3360 


ctaaacaaaa 


gcaaaaaaac 


caatgagtaa 


tattqtcaaq 


tgaacttcat 


cccaatggga 


3420 


atgcagataa 


tttgtttaaa 


aggcaccatg 


cacactgggc 






3480 


cgtcttcttt 


tgcctggatt 


cccagttggt 


ttaatcgggc 


gtagaacact 


ttcttcaatc 


3540 


cgggattcag 


gcacccctgc 


tcagcacaaa 


ctcagtacac 


cccgcactct 


gctgtgggtt 


3600 


cttggcacta 


ttaggagaat 


atcfacrcfcrcrcrt 


gattcagatc 


tatctctagt 


gggtgcatgt 


3660 


ctgccactcc 


caggaacgcc 


cacttctggc 


aagtcagtgt 


cagagaaagg 


ccagctcgtg 


3720 


gcccctcctg 


ccttgagtcc 


caggacccgt 


gatcagtcct 


acccggagca 


gaatcaggag 


3780 


tttgaaaacc 


caagtgccaa 


caatctcatt 


ttaacccatg 


taagcatatc 


caatatttat 


3840 


atatagaatt 


cataacagat 


gtctgggctt 


ccattccaat 


agcctatatt 


ttacactgtt 


3900 


tatttacatg gttacaccaa 


acaagactca 


attcaaggta 


acccaatcct 


ttgctactat 


3960 


accaaaataa 


gcaacatttt 


cagtccatgc 


cttatatata 


ttcaccaagc 


attacactag 


4020 


gcctccaact 


gctcatcgga 


gcaagctgca 


gcctggacac 


aagctagaga 


ttaatcagtc 


4080 


aggaatgatc 


ctgcgtccag 


tgccagcatg 


atggaagaga 


cagagaaaca 


gaagacatca 


4140 


gggctccaga 


gtcaaggagc 


ctgcaggtta 


gttgggcagg 


atatacacac 


atacacacac 


4200 


acacgcacac 


acaaaaccac 


ccaagaagaa 


aaggtgggat 


gaatgcatgg 


acaggtaatg 


4260 
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cctggagcct 


ggggatggat 


aagctgactg 


caggtggccc 


aggcaggctt 


cctggaggaa 


4320 


gaagacctgg 


ctgtangtgg 


ggtangcang 


ctttctaaat 


ggggaaaatc 


tggctgtggg 


4380 


tggagttggc 


angtttccga 


aaagaagaaa 


agctgactat 


gggtacacct 


ggctgttggt 


4440 


ggaacangca 


ggcttcttgg 


aagaagaaaa 


tctggctgtg 


ggtggatcan 


gcaagcttct 


4500 


tggaagaagt 


aaacctgact 


atgggtggac 


caggcaggct 


tcctagagga 


agaagaccgg 


4560 


ctgtgggtga 


accaggcagg 


cttcctagac 


agaggaagat 


ctggctgcgg 


ttagagtggg 


4620 


caggcttcta 


agaagaggaa 


gggctgactg 


tgggtagacc 


tggctgtggg 


tagactgggc 


4680 


aggcttcctg 


gaggaggaag 


agctggagca 


ttgaaaaaca 


aacatgactt 


ggtgaatgtt 


4740 


gagcatgccc 


aggcctgatc 


cccagaggca 


attacgcact 


caagttactt 


aattctactc 


4800 


acaatgcctc 


acaaacaact 


tctctgacac 


ctaacacagc 


tctgggcacc 


ttctagcttc 


4860 


agctcctcaa 


agcagttatt 


cacgctacta 


ccctgcacac 


ctcctcacac 


cccaacccca 


4920 


gggacaggag 


ttctgccaga 


tgccaaagct 


cctgatgcca 


aagcctgggt 


ctgcttccgg 


4980 


gctcctcttg 


gtctaactgt 


ccaccccgca 


tcggcatgat 


gtgcaaaaac 


aaggctttgc 


5040 


aatctgccct 


gatgcctggc 


ggagcgagtc 


cctcccgatt 


cgtctccttc 


agaaacacct 


5100 


gggctgccct 


ggtcctgtta 


tacccccaac 


acattctaca 


gtcagctccg 


caagttccac 


5160 


aaagatcaac 


gctggcgttt 


ttatggcatt 


ttatttacag 


tttttacaat 


ataaaaaagg 


5220 


aaggatgcca 


cagctcagcc 


agcaggacag 


acagagatct 


atgafcgcttc 


tgctgcacca 


5280 


ttgtttgtgg 


tcaagaaagt 


ctgttttcaa 


tgatttatta 


aattgtggtg 


ggagatggat 


5340 


ggtggcagtg 


gttaccagca 


acatgaatgt 


tcttaatgcc 


actgaacttc 


acacttacaa 


5400 


atggttacga 


cgataagtgt 


tatatgtatt 


ttaccacaat 


taaaaacagg 


taaatgcagg 


5460 


ccgggcacgg 


tggctcacga 


ctgtaatctc 


agcactttgg 


gaggccaagg 


caggcagatc 


5520 


acctgaggtc 


aggggttcga 


gaccagtctc 


gccaacacgg 


tgaaactctg 


tctctattaa 


5580 


aaatacaaaa 


attagccaga 


tgtggtggtg 


catgcctgta 


atcccagctt 


ctcaggaggc 


5640 


tgaggcagga 


aaatagcttg 


aaaccgggag 


gcagaggttg 


ccatgagctg 


agattgtacc 


5700 


attgcactcc 


agcctgggtg 


acaaaagcaa 


aactctgtct 


caaaaaaata 


aaataaaata 


5760 


aaaataggta 


aatgcaaaca 


tatggtatag 


taatattatg 


ggctattatg 


agctacaaaa 


5820 


aagaatgact 


tgggactaca 


gttacagccc 


tcattcagga 


atttgtttta 


aatgtgggtt 


5880 
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ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 
gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 
tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 
agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 
cccagacctg ggctctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 
gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 
tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 
tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 
gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 
ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 
ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 
ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 
cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 
ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 
cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 
gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 
agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 
caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 
cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 
tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 
catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 
aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 
cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 
gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 
tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 
tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 
tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 



5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 
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afcttctgtat 


ttttagtaga 


qacqgagttt 

ZJ ZD ZD ZJ 


cgccacattg 


qccacqctqg 


tctcaaactc 


7560 


ctaacctcaa 


gtgatccacc 


cacctcagcc 


tcccaaagtg 


ctggaattac 


aqqcatqaqc 


7620 


tgccgtgcct 


agcctacata 


cacttttata 


cacacatgca 


tctatgacta 


tttctctatt 


7680 


tctgtgcatg 


tgtgcgtggc 


agtacctaca 


gtttcagcta 


tgtgtctggg 


tactgtctcg 


7740 


tccaagtttg 


taagcacctt 


ctccaaagtg 


caaagcctgg 


cttgtgttac 


tatccatatg 


7800 


tttacttatt 


tgctcaatca 


atttacttat 


tagctccata 


accagcttcc 


catctgctcc 


7860 


agtagcctct 


gcfcgfccagtc 


acctctgcac 


cctaccccac 


cttgcttccg 


qafcqctqqat 


7920 


gccaatcacc 


cccgacacct 


ctacatagca 


ccaccctcga 


catgctgctt 


ctttatttct 


7980 


tatttatttg 


tttgagatgg 


agtcttactc 


tgttgcccag 


gctggagtgc 


agtggcacga 


8040 


tccaggctca 


ctgcaacgtc 


cgcctcctgg 


gttcaagtga 


ttctcctgcc 


tcagcttctc 


8100 


aaatagctgg 


gattacaggt 


gcccaccacc 


acgcccagct 


aatttttgta 


tttttagtag 


8160 


aoratQQQQtt 


tcaccatgtt 


qqccaqqctq 

-3-3 -3-3 ZD 


gtctcgaact 


cctgacctca 


agtgatccac 


8220 


ctfcggcctct 


caaagfcgctg 


ggattacagg 


tcrtaaqccac 


cqcqcctqqt 


ctgcttcttt 


8280 


aaatgccagg 


caccaacatt 


tgtgcaatgg 


qqfc qqqaqqa 

-3 -3 -3 ZJ -3 -3 _3 


aagaacaggg 


aggagagcac 


8340 


actgccggcc 


cctgcactga 


atccactgat 


caatctqqqq 


gcaactgcca 


tctccatctc 


8400 


ctgtcttcct 


atccgtgaac 


atctactgca 


gtcctctcca 


atgtccttct 


gtaaagttgt 


8460 


attatgtttt 


Qcatacacrcrc 


cttgcatatt 


agttcfccaga 


tataatccat 


atactttata 


8520 


taaaattcaa 


accacattta 


aaaaaataaa 


actagcafcga 


ctataacgga 


gtctgcaaca 


8580 


ttctcacaga 


ctttafcgata 


aaacatgaaa 


cttcaaagat 


acttaqqqtg 


qqqcagggac 

333 w ~33i3* , *' — 


8640 


aatgtttaag 


gctgcctgga 


agcctcccca 


tccctgagcc 


agaaagtcct 


atctcccctt 


8700 


caaggggaaa 


tgcttgaaaa 


agcactgatc 


aggctaaaat 


gacagggatc 


agggagtaat 


8760 


caaagtacaa 


gtgagctggt 


ctcctccatt 


ctgagcacag 


caaagttcag 


tctctccaag 


8820 


tccaagaatc 


atacacctgt 


ttgccaagaa 


tgaagttcag 


gtgtctacaa 


gtggctgaaa 


8880 


atattcattg 


ctgggccatt 


aacaacattc 


ttggcaaaac 


cataccttag 


cttctcgtgg 


8940 


aaatttctta 


aggtagaaga 


aacaggaaac 


acccaggctc 


gcttttatgt 


agacagttcc 


9000 


atgaagccag 


ggaccttccc 


cacatccacg 


tttcaattac 


ctgcacgcag 


ctcacagtgt 


9060 


attcaacatc 


tacgcgtctc 


tcctactggg 


gtggcggtgg 


ccactcaaac 


cctcatgcag 


9120 
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ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 

tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 

aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 93 0 0 

tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 93 60 

tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420 

tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 94 80 

ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540 

gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 

taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 96 60 

ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720 

accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780 

agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 984 0 

ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 

gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 

catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020 

ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080 

agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 

ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 102 00 

catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 102 60 

gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 10320 

ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 103 80 

cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 1044 0 

ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500 

gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 

cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 1062 0 

caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 1068 0 

acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 



BNSDOCID: <WO 0177327A1 _!_> 




WO 01/77327 PCT/US00/16951 

67 



accgcagagc 




ggccctgaca 


ccaaggccct 


qqggcaggcg 

ZJ ZJ ZJ ZJ —J — ' 


gggaggcagc 


10800 


tgttctcctg 


cctgtgctcc 


cqqqcaqqqc 

13 ZJ Z) ZJ ZJ ZJ 


ctggccccac 


aagggaactg 


gecgaagget 


10860 


ctcrcttcfcrct 


actccggaaa 


qtcctqqqaq 

3 3 3 -J 


acaagcaaag 


gaettgetag 


gtcactccaa 


10920 


acggcccaga 


tgtgacaact 


qtqaaqaaqc 


cacaccaaag 


caaggtgaca 


gaacaatgtt 


10980 


cjcrtcraccrtca 

3 3 3 3 


ggt tat cage 


ttacgctcaa 


ctccacttac 


ccggactcac 


ccgtaacctg 


11040 


ccgfcctcfctc 


ccaaccagt a 


aaggatgect 


aqqt aqaqqq 

v-- 3 3 *-*3 3 3 3 


gcacaaggcc 


tggagcataa 


11100 


ttaccatttt 


aaaggctctg 


agaagtcctg 


cqqtqaqqaa 

-ZJ Z3 ZJ ZD ZJ 


gectagttea 


ctttctctcc 


11160 


cctaggattt 


cccaactgcg 


cctgatcaca 


gaacattttt 


tcatttccac 


tcaggaaaca 


11220 


tattttgaaa 


aacactggcc 


tagaggcaga 


agtgaaatgg 


aaaacacaaa 


agtaaaactg 


11280 


aacaqqaqqc 


actgggcaga 


qaacqqtcaq 


aqqcqccctq 

ZD ZJ ■ -j ■— w ZJ 


aatcctggac 


cggtggagat 


11340 


ccccagcttg 


gcatgctccc 


CtCCCtqqqC 


ccagaccgcc 


tccccccatt 


tcctggataa 


11400 


gaaggctaat 


ciccfcatcacrq 


qtqaaqqqct 

3 *-3'-*«-*333>- w 


tqcctqqqct 

3 ^ 3 3-3 


acacccccag 


gctcgcccca 


11460 


caccaatcgc 


qctcctqcqa 


gagccagtga 


ctttcttgat 


ttggctactg 


tggaattgtt 


11520 


tgcaactaac 


caccccagat 


acagatacaa 


atgacaggat 


gatcagatgt 


aaaggaccca 


11580 


caggtctctg 


teratacqqet 


tcatgcagcc 


agcatggcta 


gtgccgtgca 

3 ZP — ^ 3 


gaatgagaat 


11640 


aaccccaaac 
^ >-*. >^ w w ^ >-* 


aagtccttgc 


c tcccagacc 


cagaacccca 


tggagcccac 


caqqqctqqt 

ZD ZJ ZZJ ^* ZJ ZJ *~ 


11700 


tcacaagcac 


tQtctqcrcrtc 

w 3 3 3 3 w 


qqqc aqaqa t 

333 w *-*-3 *-*3 w * w 


tccagcaaga 


ggagggaac a 


tccatgcacc 


11760 


Qcracrccaatt 


accagaagca 


aatcgcctct 


tccaaaaccc 


aggctattaa 


tggagtccac 


11820 


tcfttqaatcrQ 

~J w ^33 — J ZJ 


aqctqqqqtc 


tagctatgga 


atactgeaca 


gcagagatct 


tcctgagaga 


11880 


aagcagtttt 


ccctgaaagc 


catgtgtcct 


ccactaactg 


tgttttaatt 


gggcgaacgt 


11940 


ctgtatctca 


ttqcaqtqqc 


cgcgcatgtg 


ctgacaaggg 


qctqqgggcg 

3 ZD ZJ ZJ ZJ ZJ ^ ZZ} 


gggtggggag 


12000 


cagaagctca 


aaaacctoqq 


aqqqaaqqaa 


acaggccacc 


aqqqctCCCC 


agaaggcatg 


12060 


tatctctctc 


acaaacacac 


gcatgcacac 


acacgtgcac 


acatactctg 


caagccctga 


12120 


gttagcaact 


gtggaatgtg 


accagctcag 


tgatcccagg 


acaagctget 


agggaatatg 


12180 


acatttgatt 


gatgtctgea 


aatgtgcgtt 


ttcactaatt 


agaaggttta 


gggcagagca 


12240 


gagaaaaata 


tgtatttcag 


agtcccagtt 


tgacctgcca 


gaaaccagcc 


cattactaac 


12300 


attcttattt 


tcaacaaaat 


atagcattct 


gattacatac 


catcttggtt 


ccacgcctcc 


12360 
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tgccttgcca 


agcccccgga 


agcggcccaa ggccatggca 


aatagtgaga 


gaaacagttc 


12420 


cagggtggag 


actgactcag 


gggtgtcagt 


cagtggggcg 


ctgatggccg 


gtgggaggcc 


12480 


agcagtcatc 


accctctcct 


tgggacagtt 


gagtagctct 


cccccagggt 


catgtggcca 


12540 


ctcaggttca 


tatgggaggc 


gagaggagtg 


gcagagtcca 


ggagagtggc 


tccgaagtca 


1260O 


ctgttccctc 


caggcctcag 


tgtcttcatc 


cattaaatgg gtaggctgag gtctgggatg 


12660 


acaaggaggg 


cttgcactta 


ctgaaaccca 


tgggaggctg 


ttcgccgatt 


tcttttattg 


12720 


atggaagaaa 


acactcgtat 


aattcaagta 


ccaattaaaa 


ggcaggcact 


ggaaccaccg 


12780 


tctgccaatt 


cctagttttg 


cctataccaa 


atttgagcaa 


gttaattgac 


ctctcccagc 


12840 


ctcagtttct 


tcgtctgtaa 


aatgagggta 


gggatggccc 


ccagcccaca 


gggcagctgg 


12900 


aaggattaaa 


gaaatcaaac 


atctcttaga 


gcccacctgg 


cacactgtga 


tacacaacaa 


12960 


atgttagcta 


tttttgtcta 


tgaagtctag 


attttatatc 


ttgggtgttc 


taaagcagga 


13020 


tacatttatt 


taaaaacaag gattttcatt 


aaacacgtac 


cccacagaca 


gcaaccccat 


13080 


ggagactgct 


cttaattcag 


gccagtatcg 


aaacgactct 


aactacaagc 


tttatacagg 


13140 


tctcttggct 


gtccttcaaa 


tccaactaag 


gtggtacttc 


tgaagcactg 


tgcacatgtg 


13200 


tgtgtgcatg 


cacacgtgtg 


ggaagggcgg 


gctcacggat 


ccctcaggta 


ccccacccac 


13260 


gcagtctcaa 


gtcacaaagc 


gacagagcag 


ccgaggaagg tctgtgcccc 


actggaccct 


13320 


cgtgaagcca 


ccaactctac 


ctctgcgccg 


tgtcctgcag 


actgggctac 


cctttgggtg 


13380 


gggaccagca 


tttgatgcaa 


gaaaggcaga 


cagaaaagga 


aaagggcaag 


ttcgactcca 


13440 


gataacacag 


acagtaccaa 


gccccagggt 


ccataaatgc 


cacgcagatg 


gaagcatt ta 


13500 


ctgcgaggcc 


acacagcaaa 


cgcacggatc 


cagggacgga 


ggtgcagact 


gcggtgcccc 


13560 


tgagccatga 


ccctgcaaat 


taccaccatg 


ggaaaggagg 


ctgccaaacc 


ccccgacagt 


13620 


cggctgggct 


ggcacagact 


cgtggtttcc 


atcgaggtgg 


gaggaggtgg 


gacgtcccag 


13680 


cccctccccc 


atgcccactg 


cagagggaag 


cggccgtttc 


ccctgtgtgg 


ttacaaaggt 


13740 


ctcattgttc 


ttcctcacag 


ggaggaaact 


ggaggaccga 


gctcagaacg 


cattttagaa 


13800 


ctggcagaaa 


agaacatctg 


gggaaggaaa 


cacatttcag 


aaacaaacat 


acctttgtac 


13860 


cagcttttat 


tttctttaag 


tgttgaaaaa 


ataataataa 


taaagacatg 


ccaaatttat 


13920 


catcgctcta 


caaaatccct 


ttattgagca 


aaacgtggca 


gctctacttt 


caaatgatta 


13980 
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ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 
aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 
ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 
aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 
agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 
tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 
tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 
gcaggtgcac aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 
cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 
tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 
aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 
tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 
agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 
tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 
gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 
atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 
ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 
actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 
ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 
acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 
tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 
ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 
ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 
cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 
gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 
ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 
atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 
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tctggcccac 


agactggaga 


gcacatctct cagcaaccac 


ggagcgccaa 


ctgcgcacag 


15660 


ggcctggtcg 


tcagagcggg 


gcaaaggcac 


tgaccgtcac 


ggccagggcg 


agggaagacg 


15720 


ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 


15780 


agcagtgaag 


ctgagtgggg 


aggcgctgat 


gcagggggcc 


agaaagggct 


gctggtcagc 


15840 


cgggaggagc 


cccccacaga 


ggaagcagcc 


agcccagacg 


cagatggcag 


ggtcccctca 


15900 


acaatgtcct 


ctgaaaagga 


gaggcgggga 


ctgctctggt 


gacacctaca 


aatagatagt 


15960 


cagccctcag 


ccccctgcca 


tacttctgac 


aaagcagagg 


cccccagggg 


aggcgcaccc 


16020 


gaaggtacct 


gcacctgtcc 


cccagactcc 


tagagcccac 


ctgaccccat 


cccaccaggg 


16080 


ctccagctac 


aaaataaatg 


ccgaggccag 


ctaggcaagg 


acgcacactc 


ggtaccgact 


16140 


gaataggctc 


cacgttgtca 


tgagcgcaac 


ccacaggcca 


ccaggccaca 


ctatgcagag 


16200 


ctgagatggt 


ttcggccaag 


cagcctctca 


gctgagctga 


acaagtccag 


agtccccggg 


16260 


gggtcgtcac 


tatggagtaa 


caattgcgat 


gcgatggtaa 


ccctaacagc 


taaccgtcac 


16320 


tgagccaggc 


cctgagctag gtacttttca acgctgcctc 


tctgcagcct 


caggacgagc 


16380 


ctgtgggagc 


ataaagatca 


ttccctatca 


cggatgggga 


aactgagctc 


tgaagcagtt 


16440 


aacgtgcttg 


tcccagaccg 


cagagctagg 


agcaggacac 


aacagcaggt 


caggcaggaa 


16500 


cgggtgaggg gggcctgcat 


gggcttctct 


ggaggctgcg 


catacacgca 


acccccagga 


16560 


ccccgaccct 


gcacctgcag 


ctcgctactg 


ccccctcagt 


gactccagca 


aacctcgggg 


16620 


taggggaagg 


aggctgggaa 


tacctcgggt 


gtccgaaaca 


gcagcttctg 


cttggaggcc 


16680 


actgctgcat 


aatggttgct 


gcccagcaca 


ccccaagcca 


cctgtgccac 


ctgtggtgac 


16740 


cttccagcat 


gccttggtga 


ccaagctggc 


cttaggtgct 


gtgggcagcc 


aagaatagaa 


16800 


cagggcccac 


ccctcctctt 


cacactaaca 


caaagcaaga 


qq c crcrcrc a C t 


tcgacfcgagt 


16860 


gcatccctct 


agctcaaggg cctcacggat 


cacaggggtc 


agggcaagat 


cccaattctg 


1692 0 


cattcccgtc 


tgcctttcat 


cctgctctgc 


caacaacagc 


cagtgaggct 


ggggacatcc 


16980 


ctgaacctgt 


ttctcacctg 


aaacacatca 


taccattgga 


ccccagccct 


ccgggagagg 


17040 


ccctaatccc 


tgactgtggt 


gagatcagat 


cactggttaa 


gtacccagaa 


gggccttggt 


17100 


caggggctcc 


aggggtgggg ggtgatgggc 


gtggtggtat 


cccgctctgg 


gctatagtcc 


17160 


accctgatgg aggaggtctg tggtcagaac 


cgggctgtgc 


agggcacagg 


agcccagagg 


17220 
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gacccccaga 


gctcacctgg 


tggtctctga 


gcagggctcc 


ctcaaccctc 


agagaaaagc 


17280 


acaqcaacrcra 


qqcCQCCCSLG 

z3_3 -3 — ' 


agcccagcgc 


ctagcaccca 


gtggcgtgcc 


agacctgcct 


17340 


aaatcctcrcra 


gate tc teat 


caccctccaa 


gtcagtcatg 


cccaacccag 


ggacccacag 


17400 


c c c a ccrcrcrcrc 

^* Vp< t* w 3 3 3 3 ^ 


c cr t era a era t q 

^ -3 3«-*-«-«-3 3 w -3 


tgetgagtec 


aagaaggect 


tcgacactgg 


gaagccaagt 


17460 


ggcacctcct 


ggtgtggagc 


aggeggaate 


ccaccagcct 


ctgctctgcc 


agtgggcaca 


17520 


qctqqacqat 


qagcaqaagg 


ggctgttgct 

3 3 3 3 


taataaacgt 


catttcctta 


agaggataaa 


17580 


acctttcaaa 


acagatggaa 


attttttttt 


aattaaaact 


ggtggccaaa 


gagatggaaa 


17640 


gcaccccttg 


tgcctccctc 


ccatcgtgac 


ccatcctctg 


cacacctcaa 


gctgttcgct 


17700 


Qcccaqqtcrt 


ctcctgaggc 


ac t qq QQcr cq 


ggtgagaatc 


cgtgagccct 


cggccagccg 

3 3 3 3 


17760 


tggctctctg 


gagctctgcc 


ccaggccatc 


agggcacacg 

3 3 3 — t 


ccgggcaccc 


tgggggecac 


17820 


acacrcrcfcacra 


qcccaqctqq 

3 3 -3-3 


gtcagcacac 


agggccacac 


tgggcacaca 


agtctctgag 


17880 


cctcccctgt 


ggaegcaget 


ctcactatcc 


caccccacta 


ggtcccgggg 


atctgtccca 


17940 


eaocratcfafca 


tgctgtcaca 


gaccactacc 


aqaqccatqq 


cctgctgttc 


cgcccgcagc 


18000 


c acrcr fc acr tea 


cttgctccac 


acrcrcra c acr crc 


aacgccgcac 


ttqqqqqctQ 


ctctgcggca 


18060 


era a c t aqacj c 


tccagcagct 


cagccctcct 


qaqaaqqaqa 


actccatgct 


ctaagaggca 


18120 


qacqcaqcqq 


acggcaccaa 


agccaccaca 


agcccacggg 

3 3 3 3 


gccctgcatg 


gcaggtcagg 


18180 


agtccctgac 


cactcgctct 


ttgtaaccag 


agetgeagtg 

"3 3 3 3 


gagtctacga 


ggcaaggact 


18240 




crtQaccacaq 


caaatgaatg 


agtgtcccaa 


qqqaqcaqqc 

333 3 ^33 


ggctgcgggg 

3 3 3 3 3 3 3 


18300 


aaacacaacc 


crcrcra c c c acrcr 

333 33 


agtcctccgg 


cactgcagca 


aactccctgg 


gccccctgag 


18360 


caqcqaccaq 


qtqqcaaqtq 


catgaactcc 


cqqqqqcata 


acctgggagg 

^ ^ w 3 3 3 33 


gtgacactct 


18420 


cttcgtgttc 


aaattcttga 


gaaegcatta 


aaaatatcac 


tcagtcacct 


actctatagt 


18480 


tttaactcaa 


aagtaccaaa 


gtagecagge 

3 3 33 


gcggtggctc 


aegectataa 


tcccagtact 


18540 


1 1 ocrcraacrc t 


craqacaaqaq 


gatcacttaa 


geccaggagt 


tccaaatgaa 


cctgggcaac 


18600 


atggagggac 


cccatttcta 


caaaaaaagt 


gttttaaaaa 


attacctggg 


cctggtggtg 


18660 


tgtgcctgta 


gtcccagcta 


ctcaggaggc 


tgaggcggga 


gaaccacatg 


aacccagggg 


18720 


aggtagaggc 


tgeagtagge 


tgtgatggca 


ccactgcact 


ccagcctggg 


taacagagtc 


18780 


agactctatc 


tcaaaataaa 


tttaaaaagc 


accaagccag 


gcttggtggc 


tcacacctgt 


18840 
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aatcccagca 


ctcagggagg 


ctgaggcaag 


tggatcacct gagtcagaag ttcgagacca 


18900 


gcccagccaa 


catggtgaaa 


ctccatctcc 


actaaaaata caaaaattac ccaggcgfcgg 


18960 


tggcgggtgc 


ctgtaatccc 


agctactcag 


gaagctgagg caggagaact gcttgaaccc 


19020 


aggaggcaga 


ggttgcagtg agccaagact 


gtgctactgc acfccaagcct gggagacaga 


19080 


acgagactcc 


atctcaaaaa 


ataaataaat 


caatcaaaac caccaagact ttttaatata 


19140 


aacatttatt 


attccataat 


tccttttttg 


catgattaaa aatgtttata taaagtttcc 


19200 


tgaaaatggt 


aagaatgcca 


agtgaaggct 


gcaaatgccc aagcccccac cgtggcatct 


19260 


cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 


19320 


agtccctttg gggtacactg ggtttcccac 


accccagaaa tatgggctct tactgcagga 


-L O yj 


ccatgggggt 


cctcacactt 


ggcccagaag 


ctgtcacata gccagacagg tgttctacaa 


19440 


cctaggctag 


agggagctca 


tgctccagca 


gaattcgagc cagaggaggt aaaagatggg 


19500 


taagatctgc 


tccctggaca 


gatgaggcct 


tggcctcaga acagttactg atcatctacc 


19560 


agacatcaca 


ctagaggcag 


aggggcgcag 


acgaagacag cccctgtcct caaggccctc 


1962 0 


ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 


19680 


ggggcagaag atgggcaggt 


ctgctcaggg 


cggcacggcc tgccaggcca aaaagttcca 


1974 n 


acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 


19800 


acttggagaa 


aaattaatct 


ggccagagca 


taagataaat gggcaaaggg gaggttccag 


19860 


aaagcaagga 


gaccaagtaa 


aagctgatgt 


cattggctct gaatctaggc tttcactgaa 


19920 


tatgcaccgc 


agggcctgta 


ggtaaagcct 


cagagcccag ggagtctgag tggaggagag 


19980 


ggcaggggac 


agagctgggg 


cctgtgtcta 


cagtgctcag gaggaatagg catggacgtc 


2004 0 


agctcggagg 


ctccagctga 


agtgaggagg 


cggccagggc agcacggcca cgcccggatc 


20100 


cagactcctt 


ttgggaagca 


agttcgctct 


gggggaaagt ttggagaaat ggcctttacc 


20160 


cgcagaagca 


agccccagaa 


catatcttgc 


tccaaaacta tctcgtacag tgaggacgtt 


20220 

£t \J Ct \J 


aagcttcagg 


tcccctagag 


gagacagtct 


gctccttcct ggggcagaac ccaaggtggc 


20280 


cagagcctgg 


aaggcaccca 


gcacccaggc 


tggtgtgttc cagcccaggc cacacgctca 


20340 


gatagctatt 


aatgccccgt 


tgagcaattt 


cctgagagct ttgccaggca ggtaccgcct 


20400 


ccccatctga 


actaatacag 


gggtacatcc 


caaggaagaa atgaaaggtg cccacatttt 


20460 
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gctctgggat 
gctaatgcta 
gggcagcagg 
tgtcagcaaa 
tccacagcct 
gctggatatc 
cggctctgaa 
aaggacaatg 
atggagacag 
gtctggaatc 
gaatggatag 
cccctacctc 
ctgaatgtaa 
aatctggaga 
attatatgaa 
ctagctacag 
aaataaagac 
gtctgtaatc 
gagaccagcc 
ggcgtggtgg 
ggaacccggg 
cctggaggac 
gaaaagaaaa 
taaataccaa 
caaactggac 
atataggaaa 
gagttatgta 



taactaggga 
aaattagtgt 
agtggctcgg 
acccacaaat 
gagaccgcca 
accatttacc 
gttcaccaca 
aggtgggaat 
cagcattccg 
aggacccagg 
gatggaccat 
ataccatata 
aacataaaac 
tggagtaaca 
gcttcttaat 
accagcagtg 
ctcccacaaa 
ccagcacttt 
tggccaacat 
cgcatgcctg 
aggcagaggt 
acagcgagac 
agacaacaga 
taaaccagaa 
caaagaaaac 
gtgtgtgttc 
tgtatgtatg 



ggggagtgat 

gcattagaat 
gcaagtcggc 
gtcaaggccg 
gctcatcgga 
gtccccagat 
aaagcataaa 
ttacataact 
gtacacaaag 
tgtctgtgga 
cccaccaagg 
caaaaaaaaa 
agtaacagtc 
agggafcagga 
atctttatga 
aaacctgaaa 
tctataagaa 
gggaggcgga 
ggtgaaaccc 
tagtcccagc 
tgcagtgaac 
tctgtctcaa 
aaaatgggcc 
atgagggttg 
accaaaaacc 
tcgtagatgc 
cttgtatgta 



73 

aattaactca 
ttctttcctg 
acaaagggca 
gctccactgc 
ggccatttaa 
caagagatca 
cgtgcaagca 
gatcttaaaa 
acactcacag 
cactacacat 
tgttggacaa 
aaaaaaaaaa 
ctggaagaaa 
aaaaagccat 
taatgtacca 
caaacagaac 
aaagataaac 
gatgggtagg 
tgtctctacc 
tacttgggag 
caagatggca 
aaaaaaaaaa 
aaggataagt 
tgcaaatcaa 
aaaatcttgt 
ttgcaggtat 
tgtatttgag 



gtaattatat 
agcagacacc 
cctccagagc 
acccagcaga 
aatccagccc 
aagggtggaa 
gagccaatac 
tatgttctga 
gcagtggagc 
aaaagagcag 
ctccctattc 
aaacccagac 
ataatggagg 
agggaaaaag 
ccagaaacaa 
aaagaattaa 
aggctggcac 
tcacttgagg 
aaaaatacaa 
gctgagccag 
atcgcgccac 
aaaagaagaa 
gtaggcaatt 
aaggtgttat 
aattgccagc 
gaactgctac 
acagggtctc 



ttgccatcgg 
ggagtgagtt 
cttccacaaa 
tgaattcact 
tctgacacct 
cctgatagga 
gtcttttgaa 
tgcttcagag 
acagtgaagg 
catttacaat 
actggccaga 
agaataatgt 
atatatttat 
tagagttatg 
ggatgaagga 
agtccatacc 
cgtggcttat 
tcaggagttc 
aaattagcca 
gagaacagct 
tgcactccag 
gaagaaaaaa 
tgcagaaaag 
aatttttaac 
atcagagagg 
agccttttag 
gctctgttgc 



20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
2.124 0 
21300 
21360 
21420 
214 80 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
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ccaggctaga 


tctgttgcag 


tgctgtgatc 


atggcttact 


geagecttga 


cctcctgagc 


22140 


tcaatagatt 


ttcccacctc 


agectttcaa 


gtagctgaga 

ZJ ZJ ZJ 5 


ctacaggagt 


gtgeaatcat 


22200 


act cage taa 


ttttttaaat 


tttttgtaga 


cafcqqqqqqt 


ctcccaattt 


tgcccaggct 


22260 


ggtctcgaac 


tcctggactc 


aagtgatcct 


cctgcctcaa 


cctcccaaag 


tgctgggatt 


22320 


acctggatga 


gccactgtgc 


ccggcctcaa 


tatctttaaa 


aacagaaatg 


gacacactct 


22380 


ttgacfcagga 


atgtatccta 


taaaaacact 


tatacacatg 


cagagacaca 


cgagcaagca 


22440 


tgctttgtaa 


tagcaatgaa 


ggctggaaaa 


actcctcaat 


caggtaaatg 


ctgtcaagtg 


22500 


cacctafcata 


cfcafcgaaafcg 


acacttcract 


tttaacaaga 


gcaaagacag 


aaaagcaaaa 


22560 


gtacaaagta 


CfQQ t cr t q a t Q 


gcacatgcct 


gcagtcccag 


ctactcagga 


ggctgaggca 


22620 


ggaagatcct 


ttgagcccag 


gagttggagg 


ccaggagctg 


ggcaatagtg 


agaaaaaata 


22680 


aaattaaata 


ataataataa 


taaaataggc 


tgggcacagc 


ggctcatgcc 


tgtaatccca 


22740 


acactttggg 


aacrc tq aqqt 

3 3 3 Z3 5 


cjgg acrcja. t c g 


cttgatccca 


ggagttcaag 


gccagcctgg 


22800 


acaacaaaac 


aagacaccca 


tctcaacgac 


aaattttaaa 


aaatcageca 


ggcaggctgg 


22860 


crcatQcrtQQC 

3 ua 3 3 ^33 


fccacgcctgfc 


aatcccagca 


ctttgggagg 


ccgaggcagg 


cagatcactt 


22920 


qaqqtcaqqa 


gttcgagacc 


agcctggcca 


acgtggcaaa 


accctgtctc 


tactaaaaat 


22980 


acaaaaatta 


qctqqqcatq 


gtggcagatg 


cctgtagtcc 


cagctactga 


ggcacaagaa 


23040 


tegcttgaac 


caqqqtqqca 


gaagttacag 


tgagecgaga 


tcgtgccacc 


gcactccatc 


23100 


ctcicracataa 


crtcraaactcc 


fcgtctcaaaa 


aaaaaaaaaa 


aaaaaaaaca 


aggagecagg 


23160 


c a c act t aa aci 


t cr acrcrcf acrcrcf 


cacagaagca 


gcgcctcttc 


tgggggcacc 


cccaatctct 


23220 


aacaatccaa 


aggect cagg 


atcctgaagg 


gagaaaaaac 


gtgaagctcc 


gtgctagaag 


23280 


agaccataga 


gattggaatc 


agctggttct 


attttacaaa 


aaaaggaaac 


tgaggccctc 


23340 


agaaggtgag 


tgcctctcaa 


tgccccacag 


ggaggcaggg 

33 33 33^3 


agagggctct 


gagccctgca 


23400 


crcTQC c c t Qcra 


fctctfcgcaat 


qqqqtqqaqt 

3-333 33 J 


ggagcctgtg 


ccgcccccac 


caggcacctt 


23460 


ctcaggagag 


gagccgttgt 


catatccttg 


aaggggtcct 


tgagcccctc 


aaaaggctaa 


23520 


aaaccacttt 


cctccttgag 


tgaaccttca 


cctcagttta 


accacaagaa 


aaactacatt 


23580 


aaggcccagc 


gcagtggctc 


atgtctgtaa 


tcccagcact 


ttgggaggct 


gaggtgggtg 


23640 


gategcttga 


geccaggagt 


tcaagaccag 


cctgggcaac 


atagtgaaac 


cctgtctcta 


23700 
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caaaaaacaa caaaatcagc tgggcgtggt ggtgcacacc tgaggtccca actacttgcg 
ggctgaggtg agaggattgc ttcagcccag gaggtagagg ctgcagtaag cggtgactga 
atcactgcac tccagcctca gcaacagagc aagactcaaa aaaaaaaaaa aaagcaggcc 
9g9tgtggtg gctcacgcct gtaatcccag caccttggga ggccgagcgg gaggatcagg 
agatggagac catcctggct aacacggtga aaccccgtct ctactaaaaa tgcaaaaaat 
tagccgggcg tggtggcggg tgcctgtagt tccagctact caggaggctg aggcaggaga 
aaggcgtgac cctgggaggt ggagcttgca gtgagctgag atcacaccgc tgcactccag 
cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 
acattaaggc aaactaaaag atgtttaaaa tatatatatt aaattaaata cactccaata 
gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 
gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 
ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 
gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 
cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 
gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 
agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 
caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 
ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 
aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 
gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagcttttgg 
cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 
acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 
ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 
tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 
ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 
caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 
gactccccat ttggagagcg cattaagtgt ttccaaagcc tcacaaacca cagatgtccg 
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gctgtctcac 


ggcttctgta 


acctgaactt 


ggccctcact 


ctgccctccc 


agcactcctc 


25380 


tcagggccca 


ggcccctcct 


ctgagatgcc 


agcactgact 


ccccaacttg 


tccccatcac 


25440 


ctggctcgtt 


cctgaacctc 


ggcaggagag 


tctcaggcca 


gatcctccca 


cca.gcca.cct 


25500 


ccaccaggat 


gcaggaggca 


tgagacctgc 


tcgtgccggc 


tgggagatgc 


aaccaaccaa 


25560 


gatcaatcca 


atcagcggat 


gaactgacaa 


atataatgtg 


gtccctccac 


acaatggaat 


25620 


attattcagc 


cacaaaaagg 


gctgaaatag 


qccqqqcqtg 


atggctcaca 


cctgtaatcc 


25680 


cagcactttg 

— ' ZJ 


ggaggccgag 

ZJ ZJ ZJ ZJ ZJ ZJ 


qccqqcaqct 


cacttgaggt 


c aggag t tea 


agaccagcct 


25740 


ggccaacatg 


gtgaaatccc 


gtctctacta 


aaaatacaaa 


aattagctgg 


qcqtqqtqqc 

ZJ ZJ ZJ Zj ZJ ZJ 


25800 


gggcacctgt 


aatgcaagct 


acttgggagc 


ctgaggcagg 


agaatcactt 


aaacccagga 


25860 


ggcagaagtt 


gcagtgagcc 


aagatcgcac 


caccgcactc 


caacctgggc 


aacagagcaa 


25920 


gactccattt 


caaaaaaaaa 


ataaaaggct 


gaaacaccca 


tacgtggtac 


tacttggatg 


25980 


actcctgaaa 


acgttacagt 


aaccaaggaa 


gtcagccacg 


aagaegcatt 


gtaagattcc 


26040 


cttcatgcaa 


aatgcccaga 


a c acrcrcacraa 


ccacagaggc 


aoraaacrtcora 


ctggtgt tea 


26100 


ccaggggatc 


cggggagagg 


gaacgggaag 


tcaccgtgta 


atqqqtatqq 

-3-3-3 wwfc w 3 J3 


gttttatttt 


26160 


ggggtgatgg 


aaatctctta 


taacttgata 


gaagagaggg 


ttgtaaacac 


tgtgaatgta 


26220 


ccaaatgcct 


gccttctata 


ctttaatatt 


ttatattata 


taagtttcac 


ctcaatttaa 


26280 


aaaaaaaaca 


actcgacacc 


tttcacctag 


gaaagatctg 


gctttagctt 


gcatttcctg 


26340 


taactcctgc 


ctaaagcctt 


ccagaagctt 


ccgctgcctt 


atacratcaca 


accagactcc 


26400 


acaccatgat 


ctggcctcta 


agggcctctc 


gcaggacacc 


c c cr a crcrcr t ct a 


aggagcaccc 


26460 


gtgggcccac 


ctctgcatag 


ctgcaaagct 


tctttccctg 


tcctcccctc 


t ac a t gggaa 


26520 


gctctgcccg 


caggggcggg 


gccttatctg 


ccattctatc 


gcacfccaacc 


ctagcacttc 


26580 


actcggtagc 


agacaccaaa 


gcaaaacagc 


aacagcatta 


taccgggcca 


qqtqcacqtt 

ZJ ZJ ZJ w»w ^3 w w 


26640 


aactcactga 


attcatggta 


ggaaggattc 


tattcccatt 


ttacaggtga 


gaaaac tgag 


26700 


gcacacaaag 


gtagcatcag 


cttcctaagc 


ctcccagcac 


aggaagegge 


caggctggaa 


26760 


tcagaccctg 


ggcgcagggg 


ctctgtccac 


agtgctaact 


aactactcct 


gcccccgagg 


26820 


gctgcagcgg 


tgagtgagtg 


agtttgtcag 


tggactggat 


gtccaaggtc 


atacaggaaa 


26880 


aatccagact 


attgtaataa 


cagcctctag 


accggctggg 


gecagaaaga 


tegaggaege 


26940 
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tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000 

gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060 

ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 2712 0 

ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 27180 

acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 2724 0 

gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 273 0 0 

gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 27360 

gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420 

ttggttaacc ccactcagcc cttctgggct ctcccagagg gcccggcctt ggccttgggc 27480 

atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 2 754 0 

gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600 

ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660 

tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 2772 0 

acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780 

ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 27840 

taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900 

ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960 

aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020 

ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080 

cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 28140 

ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200 

ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 28260 

aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320 

tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380 

acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctccc cgctaaactg 2 844 0 

ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 28500 

ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc 28560 
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ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 
tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 
ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 
ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 
atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 
caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 
taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 
cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 
cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 
gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 
cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 
cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 
99£fcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 
cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 
acgccgggcg cgggaaggta tcgaatcagg 

<210> 8 

<211> 33769 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (33739) , (33749) , (33758) 

<223> Identity of nucleotide sequences at the above locations are 



28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29430 



<400> 8 
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cttcccctta cactggtcct 
gaggctctcg gtctctaagg 
ggcaacgtgg gggaacgagc 
gacggctcat tcggtgtcca 
gagggatggg gcgaaagaag 
acccagcggc ctgagccacc 
gtgtgcagga actcagaagg 
caggtttttt cacaacaaac 
ccccggacac ctccatccca 
cagagtggat gtcacaacct 
cccaggaaag gtccagtggc 
accaacccag gatagctgtg 
atccggggaa agctgagatc 
acccccagct ctgctcccag 
gggcttgagc cacctcctgg 
actgtccact gcgcagatcc 
gcaggcggcc agattatttt 
gggtaagtgg tggggcaaat 
gaccataggg tcacaaagca 
ctgaggccgg ccactgtcac 
tatggccctc caggagcagc 
cagctctaaa acaggaaagt 
agtcagtgcc ggcctcagga 
tgccagccag cccctcctgt 
acctcaggcc catggctgca 
atgttggcca ggctggtctc 
attacaggcg tgagccaccg 
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tcgacccgcc 


tcggatgaaa 


actgaatggg 


tttagcctta 


60 


gaggtgggtc 


aggatgccgg 


ggacagggtc 


ctcttcctgg 


120 


cacctacccc 


tccactgaat 


tgccctgggg 


tgtgggtacc 


180 


gggtctgaga 


tgtgttgaca 


ggaagaatga 


aaggggatgg 


240 


ccacctgcag 


ccccaggaac 


tatctggcca 


gcacaccgtc 


300 


cctgccagag 


ccaggaggag 


accctgccaa 


tgggtcacca 


360 


tcatcacagt 


taataccctc 


catgccccaa 


tgtgggaaaa 


420 


aagataattt 


ttgttatttt 


ggcaaaagga 


ggcagggcag 


480 


cctcatcacc 


cagccgcagg 


gccccggcca 


tccctgcaga 


540 


ccctgcaccg 


aaccaagtgc 


agctcccagg 


ccacaggcca 


600 


ccccggaggc 


tcccaccgca 


ggcctcccac 


cacagccggc 


660 


ttctcctggc 


ttcttttcac 


acgggtagca 


gaaagctgag 


720 


cagggaaagc 


tgagaatcgg 


cctctgctgc 


ccggacgccc 


780 


ctccagggcc 


tccttctcag 


gtgcccttac 


aggaggcaga 


840 


gcctggggca 


cgcaggatga 


acggggtcac 


ggtgcaggcc 


900 


caaggccata 


aacagcctgg 


ccacagtggc 


ttcccagctg 


960 


tgttgtttag 


caattgatta 


agtttctccg 


ctgcccccag 


1020 


gccgcaaccg 


cagcatttga 


cccgggatcc 


tgtgccaagt 


1080 


caagggaagt 


ggctgggccc 


gatgctggct 


ctgctggaac 


1140 


ctgcacggtg 


cctgggacct 


tccagcaagc 


acagagaagc 


1200 


tggcaggcac 


cttggcctgc 


agtcaggggc 


tctgtctgct 


1260 


cgctgctctg 


cctggggtca 


gggcagccag 


agagtgacca 


1320 


agggacctgc 


aggcgggtcc 


cttcctctcc 


catccctcgg 


1380 


ggccccccac 


tgcctgcctc 


tgcccccatg 


ccccaccaca 


1440 


tggccactcc 


ccaggcaggc 


agtggggatg 


ggatttcacc 


1500 


gaactcctga 


cctcaggtga 


ggagttccta 


aagtgctggg 


1560 


cgccagccct 


ccctgtggta 


ctaaacactc 


acaccccctt 


1620 
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gctggggacc ctggtgaggg 


aacacagcct 


cacaagtgaa 


gtgtggtttt 


gttgagcaaa 


1680 


tgacgcctgg gcagccctct 


catctttgcc 


taaaactgaa 


gaatttaggg 


gcgtggatgt 


1740 


ataaaacagt tggtgactta 


aatgaaaaag 


aaggccacac 


tccccccttt 


aggcaggcgg 


1800 


cctaattctt taaaagccag 


cacagggtgc 


ctttctgaac 


ccaggcacac 


agtaggtgtt 


1860 


caatggacag cagcggttac 


ttgtactgct 


catgacaccc 


tgtctgtggc 


ctctgcagct 


1920 


ggctccagcc tgacgcatgg 


ctgcgcccct 


ccgcaaggcc 


accccggtat 


acatggaaac 


1980 


tctgtggaga aggccttggg 


ggccggccag 


gacgccaggc 


ccagatccca 


tctgcgccct 


2040 


tcctccatag acctcagcga 


gctctcggca 


ccatgtgcct 


caggcccatt 


taagaagtag 


2100 


ggccggccag gcatggtggc 


tcatgcctgt 


aatcccagca 


ctttgggagg 


cccaaggtgg 


2160 


gtggatcacg agatggtcag 


gagatcgaga 


ccatcctggc 


taacacggtg 


aaaccccatc 


2220 


tctactaaaa atacaaaaaa 


taagccgagt 


gtggtggcgg 


gtgcctatag 


tccaagctac 


2280 


tcgggaggct gaggcaggat 


aatcgcttga 


gctcagcagg 


cagaggttgc 


agttagcgga 


2340 


gatcgcgcca ttgcactcca 


gcctaggtga 


cagagagaga 


ctctgtctca 


attaaaaaaa 


2400 


aaaaaaataa aaaaaagaag 


cagggccagc 


cacggacgac 


ccctcacaca 


gctcccagga 


2460 


cgcgtgcctg ggtatagggc 


tcaggaccat 


gaccgctgca 


gtggccccca 


agaaacgtta 


2520 


cttttgtcac ccaccccgcc 


tcagtggcag 


tagccaaaat 


aacggattag 


aatggaacca 


2580 


tgtgacaatg ccactgcccc 


aactgacaga 


agatggctat 


cagcagttca 


cgcggcccca 


2640 


cctatcacaa gtgcagggca 


ctctacaact 


tatgcatcct 


tccccagaca 


ccgtcctttc 


2700 


gaccctccca ggtcagcaag 


gcacacaggg 


cctacatttc 


acagccacac 


agcagagggc 


2760 


tgaggctgga actcggatgc 


tctgatttcc 


gttcaatcac 


atccccagag 


gtggcacaga 


2820 


gacggggggc ttctcttgac 


aaagtcaaga 


aagtcactgc 


cagctccact 


gaagaccaaa 


2880 


gaacctcagc tctcaaaccc 


tcttgaaggt 


gttaccgaac 


tctcccagcc 


tgtttcctgg 


2940 


gtcccgatgt tggtcccgtg 


ggacacagga 


agaggaagaa 


gctccctaga 


gcagagcctg 


3000 


gtgcacctgc cacactctca 


gagggctgcg 


cacgggcgga 


ggagccgtgt 


gcaggagtgg 


3060 


ggtctggatg gaggggcgct 


gtggccgggg 


gcagggggca 


ggggaagggt 


gctccaggtg 


3120 


gtgggcacag cacgagcagg 


ggcagggagg 


tccacactca 


gatgtgcaca 


gggagaaaca 


3180 


aatcgtgcat ttccattgga 


ataggcggta 


aaaggtagaa 


aaacagagtg 


ggggccagga 


3240 
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agggagtcgg 


agccttctag 


tgtctctctg 


caggtgagcg 


gcagcccgag 

ZJ ZJ ZJ — ' 


gtgtcagctc 


3300 


agcagacttg 


gggtccaggg 


gccgtgtctt 


ctatcactga 


ccccagggca 


cacggaactg 


3360 


gggagggaga 


gcagaggcac 


agggcacggt 


cagtgaaacg 


aaacaaggag 


tcatcaccaa 


3420 


atgcggaaag 


ggcaaggagt 


gcccgcagcc 


gcacaagggt 


tctgtctggg 


caacgtgggc 


3480 


gtcccaccag 


gccccgcacc 


ctgcaagcgc 


aaagctcgcc 


actgaagata 


aaqqqaagct 

ZJ ZJ ZJ j 


3540 


gttggagctg 


cggagctggt 


ctggggtccg 


catggagctg 

—J ) ZJ ZJ 


ggcttatgct 


gcagtcacaa 


3600 


gg999acatg 


gaagaggctg 


caggggacaa 


aaccagtgac 


cacagtctaa 


ctctgagcct 


3660 


gtggaaaggc 


gcccacagca 


ttcacccatc 


ccagagatgc 


cattccccct 


gtgcccccgc 


3720 


tccacggtga 


cagcgttctc 


caggaatatg 


atgcgcccct 


ctcctcttgc 


atcagccctg 


3780 


acagtgagta 


ttcaggccaa 


aaagcagaag 


agcacagctg 


cgtggttcca 


tttccatgta 


3840 


gttctggaac 


aggcaacgct 


aatccaaggt 


gatagaagtc 


aqqaaaqtqq 

ZJ ZJ ZJ ZJ ^ZJZJ 


tqqaqqqqqc 

^ZJZJ^ZJZJZJZJZJ^ 


3900 


99999 ttgag 

ZJ ZJ ZJ ZJ ZJ ZJ ZJ 


gatggcaaag 


qqgcaccqqq 

ZJZJZJ ^7^-3 


aactttccca 


gtggtagaaa 


tgttctctgt 


3960 


ctggaccgtg 


tggtagttat 


gcagacatat 


gcagctgtca 


aagttaatcc 


aaatgtacac 


4020 


gttaaaatgt 


gtgcgtttta 


ttgcctgcaa 


gttatacctc 


aattaaaaaa 


ataaagttag 


4080 


cactcaggct 


tcttccacaa 


cttcctgaac 


cgtgtgagct 


gattttcttg 


ctattaaaaa 


4140 


ttcacggtcc 


atggctgaga 


acagcagctg 


ccttctgttt 


gcaaagtcaa 


cgccaatcac 


4200 


tgcccggccg 


cggcagactc 


ggccccacag 


gacctccttt 


cttttttccc 


tttgacctac 


4260 


ttccctgata 


agtgacaaga 


cagccagact 


ctgggaacaa 


acgcccgtta 


ttcggccccg 


4320 


agctgagcgg 


gccctgcttc 


ctgagctaat 


ccgcccggac 


agacggaggg 

ZJ ZJ ZJ ZJ ZJ ZJ 


acgtgaqgqg 

ZJ ZJ — J ZJ ZJ ZJ 


4380 


ctttgccgtc 


ggctccagct 


gtcagtctgc 


ccgtcagact 


cgacagtggc 


cccctctgtt 


4440 


cctcccgctg 


cccccactcc 


atccccgact 


tctttttgtt 


tcctgtccct 


gacagacgaa 


4500 


catctgttaa 


aactctgtct 


gggtgagctg 


tqqccaqcqq 

ZJ ZJ ZJ ZJ ZJ 


cccacaaatc 


cccaagccgc 


4560 


accccagcct 


catctqqqcq 


ctqccqqqaq 


cactgcctgg 


ccaccctctg 


gacatagctc 


4620 


tgagagccac 


cggccagggc 


acgtgtggcc 


cgagtggcat 


ggtgcacgcc 


gctaagccca 


4680 


ctgcccaaag 


gcccccaagc 


aggagggatg 


tgcaggagac 


aaaagtcaaa 


agaacagggg 


4740 


cacgttccac 


agaggatggg 


gctggagggg 


tggcagtgag 


gaacagcagc 


ttccgaggat 


4800 


ggcggtggca 


actcccaaat 


aaggcctcac 


tcctgctgtt 


tttagctcat 


tccacataat 


4860 
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tggaaaaaca 


tggcagaaac 


cgaagccagc 


tgctgccttg 


gtcctggggc 


tgtgtggagg 


4920 


gggtggggag 


gccggaggcc 


caggctctgc 


actcgactgc 


tggggatgag 


agtgactctg 


4980 


agctgcagag 


agcagcatcg 


cagccgccat 


ggtcccattg 


agccccggcc 


acgctgggcg 


5040 


gcagaggctc 


gtgggatata 


cctgccctgt 


ctcatggggg 


tcacttcagg 


aggggcgggg 


5100 


gagccaggac 


acagcccagg 


gctagcggtc 


accctgcagc 


tcaggggcca 


cgtaaatagt 


5160 


gccaccttga 


aggcacacag 


cagtgcgggg 


ccccccccgc 


caccaacgca 


tccctacctc 


5220 


taggaggccg 


cctgtgtgcc 


cctgggaacg 


ctgctccctg 


tcccttgggg 


tcctggtgtg 


5280 


accaccctct 


cagccccttc 


cttggggaag 


gcacctgact 


ccctacaccc 


agctggcttt 


5340 


catttgctca 


aaatcaggaa 


aaagcagaat 


tcaagacatc 


acagaaatgt 


cttcgcctgt 


5400 


aactccatga 


aagataaacg 


gtcagacacc 


caggagggag 


tcccagggac 


ccttgagtct 


5460 


cacctgaggc 


tctggcttca 


aacctcgaga 


tgtttccagc 


catgctagcg 


ccgcccccca 


5520 


caacctgccc 


cacacagtcc 


tcccttggga 


actcacagat 


ttggccccca 


cctgccccgt 


5580 


ttcttctggt 


ggagtgggtg 


cgttgggttg 


gggtggggct 


ggggactctg 


gatgtgtctt 


5640 


aagagtctga 


gtgattctga 


cacagccagg 


ccctgccccc 


ctcctgacct 


tcgccccaca 


5700 


ggaaagggag 


ccacacgcct 


gaagcgccca 


gcacaccccc 


ctccgtcctc 


cccaggtcac 


5760 


ccgctggccg 


tgtgagccgt 


gctccccact 


gccccttcac 


ccaccccagc 


tcctcctggc 


5820 


agcacccagc 


cttggaagct 


acttctgatt 


acaaccgccg 


aaggaagact 


cgctccctcg 


5880 


gcactgaccc 


agacagcctg 


caccatcacg 


ctgctcagca 


caacccacac 


agccttcctc 


5940 


caaaccccat 


ggagcgggga 


gtataatcac 


cccctttcta 


ccaacggaca 


aactgaagca 


6000 


cagagaggtt 


aagtcactfct 


cctaagctcc 


caacacgatg 


acaaaaaata 


gaaggtcagc 


6060 


ccgcaagtgg 


aactaggtgc 


tccaagtccc 


cggtctgcct 


gacactgcac 


ctcctcgccg 


6120 


ccacggtccc 


gggtccgcct 


gacactgcac 


ctcctcgccg 


ccacggtccc 


gggtccgcct 


6180 


gacactgcac 


ctcctcgccg 


ccacggtccc 


gggtccgcct 


gacactgcac 


ctcctcgccg 


6240 


ccacggtccc 


gggtccgcct 


gacactgcac 


ctcctcgccg 


ccacggtccc 


gggtccgcct 


6300 


gacactgcac 


ctcctcgccg 


ccacggtccc 


gggtccgcct 


gacactgcac 


ctcctcgccg 


6360 


ccacggtccc 


gggtctgcct 


gacactgcac 


ctcctcaaca 


ccaccacggt 


cccgggtctg 


6420 


cctgacactg 


cacctcctca 


ccaccaccac 


agtcccgggt 


ctgcctgaca 


ctgcatttcc 


6480 
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tcatcaccac 


agtcccgggt 


ctgcctgaca 


ctgcatttcc 


tcatcaccac 


ggtcccgggt 


6540 


ctgcctgaca 


ctgcacctcc 


tcaccgccac 


ggtcccgggt 


ctgcctgaca 


ctgcactttc 


6600 


tcaacaccac 


fcccttggccg 


gctcccaact 


acaaaccaag 


ccatgtcttc 


catcctgaat 


6660 


cctcttggcc 


taaacatcac 


tcacaatgcc 


tccctcggga 


acaggcacaa 


gtcccaccag 


6720 


cacagcctcc 


ttcgttacct 


gcgtttccgc 


tagcccaggg 


ccagctccag 


agccctcacc 


6780 


acagagcctc 


tatccttcac 


ccccggacac' 


tggacctcac 


caacccatag 


cctggaggag 


6840 


atccctgtgt 


gaccccaggg 


cctcctctgc 


ccgactctga 


atttcactgc 


ccaacgtgac 


6900 


acctcggaag 


gctctctggg 


cactggcagc 


cctccatggg 


caccgctcct 


tctggccagc 


6960 


tctgacatcc 


cqqctqqtqa 

-3 3 3 3 


qqtqccctqc 

3 3 ~3 3 


acgaggcctc 


tgcccactgg 


gacctcacag 


7020 


CCQtCfCtQtC 


agctgcaaca 


agcgacagaa 


tttcacgttt 


tcttcacgtt 


qcccctqggt 


7080 


gagcagctcc 


aggtagtttt 


cagtcgaggc 


qaqqcqtccc 


gtcagcagcc 


aggcggcaca 


7140 


gctaattcat 


gcccgccggg 


cgcacggccg 


caataccaat 


gggcacctgc 


agcctggaaa 


7200 


gccacagagg 


aaccgagaac 


aqcqactqtq 


ctcaggtgac 


aggactqtqq 

~J ZD ZD ZD ZD 


tcttttaaca 


7260 


aaacattttc 


ctttaacgtg 


atattttacg 


qcaaqqaatq 

3^^ 33 3 


aaacctggag 


ggcaggacat 


7320 


ttggatacta 


aaaccccacrcf 




Qtctqctttq 

3 3 3 


tgaagtctga 


agcccgcgcc 


7380 


ccattctggc 


cccgctcaca 


qqtccqqctc 

3 3 ^ w 33 w ^ 


tgactcacca 


gcttcaatgc 


taqqccqtqc 


7440 


ctgtcctcca 


accagaacat 


gacttcctta 


aggacaaagc 


cgtttctcgc 


ccatccccat 


7500 


ctccctctgg 


attaagaaat 


atgggaagat 


cttctagaac 


cacctcaaat 


ttgcagagag 


7560 


ccatcctggt 


gacaaaccct 


tgaaatgctt 


ctaagaagag 


tttaggtttc 


ttctcaactc 


7620 


taaaacctct 


agaaaactct 


atttccacac 


cagctgcccc 


tggaacactt 


cagcttcaaa 


7680 


aqqqcccaqq 


gcagggagac 


qqagqaqcca 


gcatccacac 


cgagcaccag 


cctgttaatt 


7740 


aacqqqaaqc 

3 3 3 


qqqtqqqqcc 


catctccagg 


cagctctgag 


gtcaqactqq 


ggaaccatgc 


7800 


ttacaaaaaa 


aagtgaactg 


aaacgctcac 


gtcctcatgc 


aaaaccagac 


tcccagttgc 


7860 


atctttctgt 


ctcattgagg 


agctttttcc 


tccctttgac 


agaacaccct 


acacacggca 


7920 


tctggaacca 


aagcagaaag 


attcaggctc 


agagtaaaac 


agtccccaca 


ctggctgcat 


7980 


gtggacgttc 


ccggcccaga 


gtctcgccca 


agcagggcct 


ataaatgaca 


caaaatgttt 


8040 


ttctcctgcg 


tgccagtcat 


gctccaactg 


agttatgtgt 


aaaagtgcct 


ctcacggctg 


8100 
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agggcaaaaa 


cagttcccac 


aagactagag 


aaaggtgacc 


cctgacggct 


gagtctctag 


8160 


ggagcgtgga 


gctgcgtgct 


cagccctgcg 


gccctgacgg 


ctctggaatg 


gaaaagctat 


8220 


ccaactggaa 


gggcagggct 


cgctgctagt 


ccagcggtcc 


aaccccacag 


gtgtctgtgq 

ZJ ZD ZJ ZJ ZD 


8280 


tgtcagctcc 


atgccacaga 


gcccagggct 


ggggccagag 


ccaccaggcc 


ccctgccagc 


8340 


ctgcaggggc 


ctcctcctct 


gggtagccta 


accaccccct 


gtgagcgcag 

■—> ~J ZJ ZJ ZJ 


gcagcctcct 


8400 


ctaatcacca 


cagggcctgt 


ccccccctct 


cccccgcttg 


caggaaaatg 


aaccctaacrcr 

" ZJ ZJ ZJ> 


8460 


actccccagg gctgctctgg gcctggacat 


ggagactggg 


aattacattt 


qcaqaaqqacr 

ZJ ZJ zjzj ZJ 


8520 


cgcaatgccc 


ttgaagggct 


cagccacgag 


cagccagtcc 


ccagggctca 


gaaggcccag 


8580 


ctgttagaac 


cctgggagcc 


agcaaagagc 


caggggctcc 


acctaagtct 


atagcccctg 


8640 


cctcttctgg 


ttgggaaaga 


aatcaacgcc 


cctttactgg 


ctcccactga 


cagcccactc 


8700 


ccccaggtat 


gggaggattc 


tgggacgatg 


caggcaaacc 


tggaccctga gtgaacctgc 


8760 


cccagctctc 


acgggcctgg 


caccagccac 


agcacctaag 


gcgccggtca 


tggtgacaac 


8820 


atgaaggtga 


taagggcatg 


gacagtggac 


atggcagctg 


gacactgggc 


acccactgga 


8880 


tgccaggcac 


ccagcacggc 


tccgtcaccc 


ctggatgagc 


agtggccctt 


tgcaagccag 


8940 


ggtagcctgg 


gcaagttatt 


tgggggtctc 


caagcttgtc 


cagctgtgcg acttcactga 


9000 


gccatgagtc 


tgggatttta 


tcagggccca 


cacccgttcc 


tggaactctg 


atacgtgagg 


9060 


gagccacaca 


gggaccctta 


acaaaagctc 


ccagggcaac 


atgttctctt 


gcctcagtct 


9120 


cccaaatagc 


tgggattaca 


ggcgcacgac 


taccgcccgg 


ctaatttttg 


tatttttagt 


9180 


agagacaggg 


tttcaccatg 


ttggccaggc 


tggtcttgaa 


cccctgacct 


caaatgatcc 


9240 


ttccactgtt 


agggcaaggc 


acctgacagg 


cacgactgca 

ZJ ZJ 


cgatctgctt 


gttgggggct 


9300 


gtgtccattc 


cccactcctt 


cgacaaatgt 


ccacacccag 


ccttgctttg 


acaccccaag 


9360 


aacagagatg 


gtgacacctg 


cttcctacat 


gcccattgct 


ctcccaaggc 


agacatcccc 


9420 


agcagatgca 


acacagtgtt 


taggcagaca 


tcaccaatcg 


atggtggcaa 


cagacaccag 


9480 


gccctgctcc 


ctctaactcc 


agtggccagg 


ccccaagcca 


gctctcacct 


gcccactccc 


9540 


aacccacagc 


agcaagactc 


agaaatggca 


aaaacacaaa 


gagaacagaa 


acgccccata 


9600 


gcgggaggat 


gactaaaaga 


catgtcttga 


taagatattg 


ttcaggcata ggccaggcac 


9660 


agtggctcat 


gcctgtgatc 


ctagaacttt 


aggaggctga 


ggtaggtgga 


tcacctgagg 


9720 
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ttaggagttc aagaccagcc tagccaacat ggtgaaaccc catctctact aaacatacaa 
aaattagcca gacatagtag cgggcgcctg taatcccagc tgcttgggag gctgaggcag 
gagaattgct tgaacctggg aggtggaagc tgctgtgagc cactgtactc caacctggac 
aacagagcaa gactctgtct caaaaaaaaa aaaaaaaaaa gatatccttc actaaaactc 
atgtctttga tacatattta cctcctgcaa tcgcaaatgc ttctgcagtg cataaagtga 
aataaatagc aggaagcctt acggttcgat cacccacaca gacacacagt cacatacagg 



aaaaacgcag 


ggagggctgg 


ggaacaaaaa 


aacagaagat 


aaaatgtgga 


gacagacaca 


ccaagagagt 


aagagaccac 


ctccagacct 


cccttcagct 


tctcaaacac 


acgagccggg 


cccgttacag 


aatttgcggg 


gaccgctgca 


aaatggaagt 


gcagacagcc 


ccttactcaa 


aaggtaggaa 


tttcaggtca 


acaacagagc 


tcacctcata 


tgactacaca 


ggtcacacag 


cccgtgaagt 


cggtcccaac 


accagcatgc 


tcctgcctca 


aagccgctgc 


acgtgctgtt 


ccttctcgcc 


tttccctctt 


ttagtccttc 


agatctcagg 


cctcctgaga 


gagacctctg 


acctgccggc 


tcaggcggcc 


acacccccag 


tacaggagtc 


tccggctcag 


cccctgctgt 


gttccgtacc 


cgatccaggt 


ctgtcctatg 


tccatctgtg 


tgccggcttg 


cttcctgaca 


tggcccccac 


cacacgtgtg 


cctcggggca 


ggggaacagg 


cccgtctcat 


taactgcttt 


cttctcagat 


attttctgga 


atatttgtgg 


atattgggca 


acatatatgc 


tccacctttt 


tcagactagc 


caggacgagc 


tgcatttttt 


tttttttttt 


tttgagacag 


ggtctcactc 


tgttgcccag 


gctggagtat 


agcggcatga 


tcttggctca 


gtgcaacctc 


cgcctcctag 


gctcaagcaa 


ttctcctgcc 


tcagtctccc 


aagtagctgg 


gattacaggc 


ccgtgccact 


actgcccagc 


taatttttat 


atttttagta 


gagatggagt 


ttcaccatgt 


tggccaggct 


ggtcttgaac 


tcctgacctc 


aaatgatcca 


cctgccttgg 


actcccaaat 


tgttgggatt 


acaggcgtga 


gccactgcgc 


ccggcccgag 


ctgcctgttt 


tacacctttg 


ccatattccg 


gtgattctct 


ctcccctccg 


tcccccggcc 


ctgactgtgg 


tggccactcc 


ctgccgtcat 


gagcccgtat 


gtcctcactc 


tttccctttc 


cgccaggact 


tcaaccaaca 


ctgcagagcg 


cagggtccag 


ctccagcact 


gagttcagcc 


tcttctcacc 


aacagacagg 


caggaaagaa 


aacaaactct 


gagaaggcca 


aggttcccgg 


gcagccagca 


agccaagcat 


ccttctccgc 



tgaggcttgt gcagccgagg caccccctcc tccagggagc aggcagcgtc ctggggcagt 
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ctgcgaggga gaccagggcc cttgctccac cagggcccca ggtatggggg cagcagcaaa 
ctcatggctc tgggagccag accccacctg ctagaaccta ctatgccacc tgctgtgggc 
aaccccaggc tggtgacttg ccctggcctc ctctgtaaac aaagggctca tccaacctgg 
tcaaaccact cctccccttc aagggtctat aatcctccct taacctgctt ggtccaaacc 
cctggtgtcg ccaggtcact caggaggcag ctcatctgga ctccttccct gggtccagtt 
tctctctcaa cattgccttt gaggccgagg tgaacggtca acagcgaagg gccccagagg 
tgatggagga gcgggtgtcc aagacactca ccctttctaa tgcactgact ccctcgtgga 
ctcacttgtg ccgtctcccc cacccaccca gccccagagc ccagagtgcg agcgccagag 
gcccgggatt ctgtctgcac cgcggggtcc ccagtgcctc ggagcaatgc cagcacccgg 
caagtgttcg acaaatgcct gctgaatgag caaatggatg gatgaacgaa tgaatgagca 
agcagatgaa tgaatggggt gctgtccaga gccgtgagga ctaggccgcc caagtcccca 
tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctcc 
aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt 
cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 
caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 
cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 
ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 
cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 
gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagccccc tgccctcccg 
agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 
catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 
gttaccccct cgctfcgccag gccccagggt cagccccaga ggccagactg gctatcccag 
gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 
cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 
atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 
aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 
ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 
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cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13 02 0 

ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13 080 

cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 1314 0 

gcatgtgtag aaaggctcgc ttggggacag caggcacagg tgggagcagc cgccattgtc 13200 

ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 132 60 

ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 1332 0 

tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380 

ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440 

gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500 

cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13 560 

cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 13620 

ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680 

ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740 

gaacactgtc cttcttgctg ggatggccaa gcttgtcact ctcccaagcc ctcctatgac 13800 

caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13860 

gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 13 920 

cctctccctc cctcatgctg ccccaagtgt agatgggttt tgtgaccacg actttctcac 13980 

caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 1404 0 

cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100 

ccatccctgg tccctgtggg agtgggccac tgtccccaga gccgcagccc cactgtcaca 14160 

gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 1422 0 

agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 142 80 

tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14340 

cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 144 0 0 

catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14460 

tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14520 

acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 145 80 
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gcctgaaata 


gcccccatga 


gctctgcaga 


ggggtcctct 


gcaggcccgt 


gtcccccagc 


14640 


cagccacaca 


cctccctcca 


ttgcagcagg 


taccccttta 


gagagggggc 


cccccagagc 


14700 


atgggcttct 


gcagggaggg 


gtcacctgcc 


cccccacccc 


acccacgccc 


gcgcaccccc 


14760 


acgcccccgc 


atcctcccac 


tcccctgccc 


cgcgcccccg 


ctccccccag 


ccccctcacc 


14820 


ctctcccccg 


tgccccaacc 


ggcactcaca 


aaaaggctgc 


cgctcctggc 


tcagcacctg 


14880 


gatgtccatg 


ggtgagtata 


gggcactcag 


gatctccttc 


ctcttccccc 


cagtgcgctt 


14940 


gttgcaggca 


tggatggagc 


gggtctgcca 


gtctgtccag 


tacagagtgt 


ccccggagag 


15000 


cgtcagggcg 


aaggggtgcg 


tcaggctgcc 


ctccaccacc 


ttctgcctgc 


agtcagggaa 


15060 


gcggggtgga 


ggagccatca 


ggagggtccc 


ccgacagtca 


ttgctgctga 


cccaattaat 


15120 


ttcttttttt 


ttttttgaga 


tggagtctcg 


gtctgtcgcc 


caggctggag 


tgcagtgatg 


15180 


taatctcagc 


tcactgcaac 


ctccgcctcc 


cgggttcaag 


caattatcct 


gcctcagcct 


15240 


cccgagtagc 


tgggatcact 


gatgcccacc 


actacgccca 


gatgattttt 


gtatttttag 


15300 


tagagacagg 


gtttcatcat 


gttggcaagg 


ctggtctcga 


actcctgacc 


tcaggtgatc 


15360 


cacccacctc 


agcctctcaa 


agcgctggga 


ttacaggcgt 


gcgccaccat 


gccaggcttc 


15420 


ccatttgctt 


tcaaccagac 


aagtgaggcc 


aggtcaagag 


ccccaggagc 


tggcgccctc 


15480 


gtacatttct 


cccggcgtgc 


acagggcacc 


tcccaaacac 


agcctgtgat 


ggtgacacac 


15540 


gggctccccc 


aggtcaagtg 


gcaaagtctc 


ccccagggaa 


gaaaggagga 


agccatgcct 


15600 


ggcaaaaagc 


acacctctcc 


tgcccaacgc 


tttaacctct 


gtatacaaat 


caggccatgt 


15660 


gcactcgctc 


cttcttacaa 


tgctcataat 


ttatactttc 


agagtaaatg 


aaacttggca 


15720 


tcaacccgag 


aaacagctat 


tcttttctag 


atgcttacag 


tgcccagcaa 


atgaggactc 


15780 


gggtgtaatg 


agattatgga 


cactggaaac 


aggatcataa 


tgtgacgtgg 


tcggtaatgt 


15840 


gcagttttat 


ttgcttaatg 


accctcgccc 


cgtgacaggc 


tccctgaggg 


tgggcctggg 


15900 


ggcagaggtc 


cccgccacgt 


ccccagccct 


cagcacagtt 


gccaggagag 


ggtgacactc 


15960 


atgaagtggc 


acagggaaga 


tgggagctgt 


gggctctgca 


gatccaccac 


ctcttctgtt 


16020 


catttttgtt 


gatgctgttt 


tttaagaaaa 


ttattgaagt 


aaaattcaca 


ggacatacgt 


16080 


ttactttttt 


tttttttttt 


ggagatgggg 


tctcactctg 


tcacccaggt 


tggagtgcag 


16140 


tggtgtgatc 


tcagctcact 


gcaacctctg 


cctcccaggt 


tcaagcgatt 


ctcccacctc 


16200 



BNSDOCID: <WO 0177327A1_I_> 



WO 01/77327 


• 


89 




• 

PCTYUS00/16951 


cgcctccaga 


gtagctggga 


ccacaggcgt 


gcaccaccac 


acccagctaa 


tttttggggg 


16260 


gtatcttttt 


ggtagagaca 


gggtttcgcc 


atgttgccca 


aggctggtct 


tgaagccctg 


16320 


agctcaggcg 


atccacccgc 


cttggcctct 


caaagtgctg 


ggattacagg 


cataagccac 


16380 


tgcacccagc 


ctaaatttac 


cactttaaag 


tgaatagtgt 


tacctagtgc 


attcgcaagg 


16440 


cggtgcagcc 


tccacttctg 


tctagttcca 


aagcacttcc 


attgccccac 


aggcaaaccc 


16500 


cacacccggc 


agcagtcatg 


ccccagtccc 


cgcccccagc 


cccggcaaac 


acttttgatg 


16560 


gacttaacta 


cacacattct 


caacatctca 


tataaacgga 


atcacaatat 


acagcctctg 


16620 


atgtctgtct 


tctttgactt 


ggcaccatgt 


tttcgaggtt 


catccaggct 


gtagcatgtc 


16680 


agtgcttcat 


cccgttttag 


gggtgaacca 


tattccagtg 


tgcagacaga 


aaccaatctg 


16740 


tgcatccatt 


cacccactgg 


gggacctttg 


tgtcatttcc 


accctcggct 


gttgtgcaca 


16800 


gtgctgctac 


ggacattact 


gtccattcac 


attttgtgtg 


aagacctgtt 


ttcgattctt 


16860 


aagagtatac 


agctaggagc 


ggaattgctg 


ggtcatacgt 


aaatcaatgt 


ttacgtctca 


16920 


aggaatcaac 


aaactgtttt 


ccacaatgtt 


gtcttttttg 


tttgttttct 


gagacagggt 


16980 


cttgctctgt 


cacccaggct 


ggagtgcggt 


ggtgtgatca 


tggctcactg 


cagcctcaat 


17040 


ctcctaagct 


caatccatcc 


tcctgcctca 


gcctcctgag 


tagctgggaa 


cacaggtatg 


17100 


taccaccatg 


gccagctaat 


tttctaattt 


tatttttttt 


tgtttttgtt 


tttttgagac 


17160 


agagtctcgc 


tctgtcgccc 


aggctggagt 


gcagtggtgc 


catctcagct 


cactgcaagc 


17220 


tctgcctccc 


gggttcacac 


cattctcctg 


cctcagcctc 


ccgagtggct 


gggactatag 


17280 


tcaccggcca 


ccacgcctgg 


ctaatttttt 


tgtattttta 


gtagagatgg 


ggtttcaccg 


17340 


tgttacccag 


gatggtctcg 


atctcctaac 


ttcatgatcc 


acctgccttg 


gcctcccaaa 


17400 


gttctgggat 


tacaggcgtg 


agccaccacg 


cccgacctta 


cttttaattt 


tttaatttta 


17460 


ttattttatt 


ttattttttt 


tttttttgag 


acagagtctc 


gctctgtagc 


ccaggctgga 


17520 


gtgcagtggc 


gggatctcag 


ctcactgcaa 


gctccacctc 


ccaggttcac 


gccattctcc 


17580 


tgcctcagcc 


tcccgagtag 


ctgggactac 


aggtgcccac 


cacgatgccc 


ggctaatttt 


17640 


ttgtattttt 


agtagagaca 


gggtttcact 


gtgttagcca 


ggatgatctc 


aatctcctga 


17700 


cctcgtgatc 


cgcccgtctc 


agcctcccaa 


agtgctggga 


ttacaggcgt 


gagccaccgc 


17760 


gcccagcctt 


tttttttttt 


tttttttttt 


ttttgagata 


gagtcttgct 


ctgtcgccca 


17820 
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ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 
attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 
taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 
ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 
ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 
cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 
gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 
gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 
atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 
agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 
ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 
ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 
taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 
aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 
gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 
cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 
ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 
tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 
tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 
cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 
agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 
tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 
tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 
tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 
ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 
tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 
taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 
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tattttcaaa 


ccagcccaaa 


agagatgtgt 


ggctcagcat 


aagaggaaca 


ggagaaactg 


19500 


aggcctcttg 


ccctgagaat 


gagcttggaa 


gtggatgtcc 


CygCCtCaCt 


caaaccttca 


19560 


gatgactgag 


gcccagccag 


gagcttgagt 


gtaccctcag 


gtcataccct 


gagccagaag 


19620 


cacccagcta 


atccactcct 


catcactgac 


tccctcccca 


taaaaaacct 


gtttgctgtt 


19680 


tcaggctgtt 


aagttgtggg 


ctgttttgtt 


acacagcaat 


ggataactaa 


cacacgaggc 


19740 


ctggcaagtg 


tggagcaaag 


ctgcccaagc 


cctcaagtct 


gttcafcgfcgg 


gtgttggcct 


19800 


gtgtttgcag 


aaatccagcc 


ac tgagtcct 


cccatgcagt 


cactactgcc 


ctctgcacag 


19860 


acacctgcca 


catccctgcc 


tgggccagga 


gctccactag 


tgcaggaatg 


gggtctgccg 


19920 


tcccaggagg 


atccctgaca 


cctagcacag 


ggctagcagc 


aggcagcact 


tggttagtga 


19980 


ataaactgcc 


cttcacctgt 


acacagaagg 


gatgtttcta 


taaggggtaa 


ttaagtacag 


20040 


agctgggaag 


ctatgctgac 


cagaaggctc 


taaaagcaat 


taaccaacga 


ggggaaaacc 


20100 


cttcctactc 


attctcggcc 


catttitattg 


agcactgacc 


atgtggaagg 


ccccctggtg 


20160 


agactgggga 


atgcaccaat 


aactgagaca 


gcttccggct 


yULyCCC UCa 


ggatgcctga 


20220 


gctgggatag 


ggccagggtg 


ggggtggtgc 


gtgtgacagg 


gttactgttc 


acaaccctgc 


20280 


cgggccataa 


gccctcccca 


acaattccaa 


aatccaaaac 


gctctgaaga 


tggaaagctt 


20340 


ttgttgctca 


tctggtgaca 


aaacctcatt 


tggtgcatgg 


gccgggtgcg 


gtggctcacg 


20400 


cctgtaatcc 


cagcactctg 


ggagccgagg 


ggaaggatcc 


cttgagctta 


ggagtttgag 


20460 


accagcctga 


gcaacatgtg 


agaccccgtc 


tctaccaaaa 


atacaaaaat 


tagccaggtg 


20520 


tggtggcgca 


ctcctgtagt 


cccagctact 


cgggaggctg 


aggcgggagg 


atcgcttgag 


20580 


cctgggaggt 


gggggctgca 


gtgagctgag 


attatgacat 


tgcactccag 


cctgggtgaa 


20640 


agagtgagac 


tctgtctcaa 


aaaaacaaag 


ttaaaaaaaa 


aaaaactgtg 


catgggtgtg 


20700 


ggctacagat 


agtcttttct 


gccctactta 


gaatgaacgt 


gccacatttg 


ctatagaaat 


20760 


attcaagggc 


tggtggcaaa 


tgccacacag 


accctgacgc 


tgttccaagt 


tctgagaagt 


20820 


cctgcattcc 


tcagggcccc 


agagfcfcfccag 


agaagagfccfc 


gtaggcctga 


gttaagaagg 


20880 


aacgccttca 


aaagccctgg 


ggacaaaggg 


gaaaggggtg 


ccccaggact 


gcgtgggtac 


20940 


ctaccggaac 


gagccgtcca 


ggttggcacg 


gtggatgaag 


ctgagcttgg 


cgtcagccca 


21000 


gtagagcttc 


tgctcctcca 


ggtcgatggt 


cagtccattg 


ggccagtaaa 


tgtccgagtc 


21060 
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cacaatgatc 


ttccgggtgc 


tgccatccat 


ccctgcccgc 


tcaatccggg gcgtctcacc 


21120 


ccagtctgtc 


cagtacatgt 


acctgtgacg 


ggggcagggc 


aagagaagca 


gctaacacag 


21180 


atctgttttt 


tgtttttgtc 


tgcatagatg 


cagacatgaa 


acaacagaca 


gtgaacttgc 


21240 


cctaaaatct 


cacccatcgg 


aaataaccaa 


caggtatggt 


ttcaggtatt 


cctgccttaa 


21300 


gctgggcaat 


caaaatatac 


tatttccaac 


ttgttctcag 


ttaacagtaa 


attctgggca 


21360 


ccttcccttc 


ttgtggatag 


aaagattcct 


tgttcttttg 


atgattgcct 


agtgtactct 


21420 


gctgtaagtt 


ttttaaagaa 


cttcaggtta 


tttctgattt 


ttttgctacc 


atgaaaatgc 


21480 


tgtaaatgaa 


cctctaaaag 


gcaattcaaa 


acactcagga 


tggaatatta 


tttagtggta 


21540 


taaagaaatg 


agctatcggc 


tgggcccagt 


ggctcacacc 


tctaatccca 


gcactttggg 


21600 


aggccaaggc 


gggtggatca 


cgaggtcggg 


agatcaagac 


catcctggct 


aacacagtga 


21660 


aaccccgtcc 


ctactaaaaa 


tacaaaacat 


tagccaggcg 


tggtagtgag 


cacctgtagt 


21720 


cccagctact 


taggaggctg 


aggcaggaga 


atcatttgaa 


cccgggaggg 


ggaggttgca 


21780 


gtgagcagaa 


atcgcaccat 


tgcactccat 


cctgggcgac 


agagcgagac 


tccatctcaa 


21840 


aaaaaaaaaa 


aagaaaagaa 


aagaaatgat 


ctatcaagcc 


atgaaaagac 


atggaggaaa 


21900 


cttaaatgca 


tgttagtagg 


tgaaagagcc 


aatctgtatg 


agtccagttc 


taaacactct 


21960 


ggaaaaagca 


aatacacaga 


gacagtaaag 


catcagtggt 


tgccaggagt 


tggagaggag 


22020 


agggatgaat 


gagtggagca 


cagaaaatca 


gggcagtgga 

ZD —J ZD ZJ ZJ ZD 


actatcctgt 


atgacatgga 


22080 


atggtgggtg 


catgtcctta 


ctcatctgtc 


taaaccaaga 


atgtacaaat 


caagggcgaa 


22140 


ccctcgtgta 


aacgtggatt 


ttgggtgatg 


gtgcgtcagc 


cagctttcat 


cagttgtaac 


22200 


aaatgtacca 


ccctgcacag 


gatgctgaca 


gttgggaagg 


ctgtgtgggt 


gtgaggacag 


22260 


ggatgtatag 


gaactcagta 


cctgctgctc 


atcaattttg 


ctgtgaacct 


acaactgttt 


22320 


gaaaaaatta 


agtctattta 


aaaacaacaa 


aacatggcca 


ggcacgatgg 


cttgcacctg 


22380 


taattccagt 


acttcgggag 


gctgaggtgg 


gtgggtcact 


tgagccaccc 


tgggcaacat 


22440 


ggcaaaatcc 


cacctctaca 


aaaaataaaa 


attaaaaaaa 


agttagctgg 


gcatggtggc 


22500 


acactcttgt 


agtcccagct 


acttgggagg 


ctgacgtggg 


aggatccctt 


cagccctggg 


22560 


aggtcgaggc 


tgcagtgagc 


tgtgactgta 


ccactgcact 


ccagcctgga 


tgacagagtg 


22620 


agaccctgcc 


taaaaaaaaa 


aaaaaaaagg 


ctgggtgcgg 


tggctcatgc 


ctgtaattcc 


22680 
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agcgctttgg 


gaggccgaga 


tgggcggatc 


acgaggtcag 


gagatcgaga 


ccatcctggc 


22740 


taacacggtg 


aaaccccgtc 


tctactaaaa 


gtacaaaaaa 


aaaaattagc 


cgggcatggt 


22800 


ggcggacacc 


tgtagtcaca 


gctactcggg 


aggc tgaggc 


aggagaatgg 


cgtgaacccg 


22860 


ggaggcggag 


cttgcagtga 


gccaagatca 


caccactgca 


ctctcagcct 


gggagacagc 


22920 


aacactccgt 


ctcaaaaaaa 


aaagaataaa 


acccatggct 


gggatggacc 


ctgaacctgc 


22980 


agctgcagct 


gttcctgggt 


aggtctgtgg 


gcgacgtggc 


tttgcttctc 


catgttccca 


23040 


agagacaagc 


atcacccatc 


catgagaaac 


aagcacatcc 


tcagggcgcc 


cttacgtgat 


23100 


ctctggccaa 


tgaaccaaga 


caaagtgagc 


agacaccagg 


tctgggatgg 


caggtcccac 


23160 


ccccaccagt 


gcccagtgtg 


ccctgtttgg 


aggtgaccac 


agggtgtgtg 


cccagaggct 


23220 


gggcgtgact 


ctcagcggag 


accagagggg 


aaccacacca 


gcttggagga 


ctcagttccc 


23280 


atcccagcca 


gctgggatga 


gccacaggac 


acaagggctg 


gcagacctat 


tgtgttttgt 


23340 


ccacccttca 


cagcagagaa 


aggggacagt 


gcccagaatg 


tcctctgagg 


agcctcctcc 


23400 


cactcttggt 


ccttgtaaaa 


tggtgctgac 


tcccttgctc 


ccttcttcct 


ggggtgggcg 


23460 


gcaaacccca 


ttcccctcag 


ccttagcaag 


tgatttagaa 


acaggcagct 


cgcccaagcc 


23520 


aggcatgaga 


gtgatcccgg 


gacacaggga 


gaacaagccc 


cgctttgccc 


tctgggggtc 


23580 


tccattcagc 


agaagaggca 


aatgacagac 


acacagccgc 


ctcctccccc 


accatggtgc 


23640 


tctgcagcct 


caggagcctc 


aggtgcacca 


agggccaccc 


catccagggg 


gccatgcttc 


23700 


cttgagtggt 


atcgttcctg 


agcgagtacc 


atctccacct 


tccagagggg 


ctgtgacaag 


23760 


atcaacaaga 


atgagggcat 


aggagcctcg 


aaccaaacat 


gccctcttcc 


ctgcagaggc 


23820 


tgactgcgcc 


cagctgctat 


caccaagccc 


ctgctcctcc 


ggccccgtgg 


ggacagggta 


23880 


agaggggtgt 


cacatggaac 


agctctccaa 


acagtccctc 


tcaagctgct 


gtctcctgtg 


23940 


catctagtga 


gaacccaacc 


aacaaaggga 


aggtgggaat 


tgctattccc 


attaggcaga 


24000 


tgagaaaact 


gaggccccga 


aaggctggcc 


tgttccaggt 


tacaggcgct 


gagcggctgc 


24060 


tctgggaaca 


cacttggtgt 


ctgctgaggg 


cccgagcccg 


gccatcatat 


gactcaccct 


24120 


tcgccagcaa 


agcccgggtg 


tgggtgaact 


ttccctggca 


gcctgggact 


ccaaggtgct 


24180 


ggcagccagc 


ccagggaagg 


ctcccgcgtg 


cctgcggcag 


acgccttgct 


ttacctgcac 


24240 


gtccccaccc 


ctaggagcct 


ggacagagcc 


cagaccctcc 


gccacctcct 


gagaaggtat 


24300 



BNSDOCID: <WO 0177327A1 J_> 




WO 01/77327 PCT/US00/16951 

94 



caggggcatc 


agtctggact 


tgggggggaa 


tccacacagg 


ccttccccaa 


atgctccacc 


24360 


gtggcccatg 


gaaaaggctg 


gaaaacgtgc 


aggagcagga 


gcctccgcat 


ggagcataat 


24420 


tcacattcct 


tccccgagtt 


tcataacaga 


ggcctgctgg 


tttccttaaa 


tggggaattt 


24480 


gcgagccagt 


cggtgaccag 


agactggttg 


gcgtggacgt 

ZJ ZJ ZJ ZJ ZJ 


gctcttgcag 


agtctcaaac 


24540 


gctaccacaa 


gcccagccaa 


attccacgga 


ggaaaatcga 


cttccgaaga 


aaagagctgc 

~j zj zj 


24600 


agcatggcct 


tcgtgcagag 


ccagctgcgg 


ttgtggttgt 


gtgttatttt 


agggaagggc 


24660 


cattttgcat 


tttaaagagg 


gggttgggtt 


tcaccctggc 


tttaatttga 


gacccggggg 


24720 


ccactgcagc 


cccttgtcag 


gctggtacag 


gccggggact 


cctcccatgc 


taagccagtg 


24780 


tctttctggc 


cccagatcct 


caggggccag 


agggtcatcc 


ccagagcccg 


ctctgccacc 


24840 


cacatgggta 


ccctgggcct 


gggagggatg 

mJ mi ~J — * ^J -^J — ' 


tgccttccct 


caaccctgcc 


tggatgtccg 


24900 


cacggggcca 


cctgcattgc 


tgaaactgca 


acgaagtcga 


gtctcaggag 


gggcccccct 


24960 


ggctgcaggg 


ctcttgatcc 


ttttggccac 


gtgcacactg 


aggtggacgc 


tcggacccag 


25020 


agaccccctt 


catgatgatg 


gccggggcag 


gaaccccctc 


ctctgaggaa 


ggaccctggt 


25080 


gggggacagc 

~J ZJ ZJ ZJ ZJ ZJ 


actgcaggag 


ggcacaggag 


atgacggggg 

«»■• ~ ZJ ZJ ZJ ZJ ZJ ZJ 


ctctagcagg 


qccqqqaqqa 


25140 


aggccaagat 


gctcctcgca 


accgtgtgcc 


tqtqqccaqq 

w 3 ^zjzj ^ ZjZj 


acagaggaca 


aacccaccct 


25200 


ccactgtccc 


cactctcagg 


acagcagtcc 


tgccccagga 


ctcagcgccc 


acacttatgc 


25260 


ctgaggacca 


ctattcaagt 


cagtattfcgg 


cgagcagggg 


ttgctgccgc 


qqgcgctgtq 

Z? ZJ Z? w 1 ~~ZJ ZJ 


25320 


acaggctgga 


atcctctccc 


tctccctctc 


cctctccgga 


gacatggagc 


ctacagggac 


25380 


agagtcagca 


cctcagggta 


ggaccatggc 


tggcgtcatc 


agcatcactg 


gatctgatga 


25440 


gtgggagccg 


gcatctcact 


gttttcactc 


tctcattcaa 


atgactggag 


caaagggaag 


25500 


gtgtggggag 

■mJ mmf ZJ ~J ZJ ZJ ~J 


aggcccagga 


atcaacacta 


aggtcaactt 


tgcccccagg 


qqcaqqqqtq 

ZJ ZJ ZJ ZJ ZJ ZJ Zj 


25560 


ggagtgaaca 


gccacaggfcg 


tqatcctqqq 


qaqqqcttct 

ZD ZJ ZJ ZD www 


qqqaqaqaat 


tcagaggcaa 


25620 


gcatgtagag 


gaaccatttc 


aaatagttaa 


gaaaagccag 


agccaaacag 


qqacaqttqq 

ZD ZJ ZJ ZjZj 


25680 


ctcgcagaga 


tgatgcaggc 


aaagccagct 


cagatctgag 


catgggaaag 


actactccca 


25740 


accaagggcc 


cagcatctcc 


caaccaagca 


ccaagtacct 


cccaaccaaa 


tgccaagcac 


25800 


ctcccaatca 


aatacctccc 


aaccaagcac 


ctagcacctc 


tcaactggac 


accaactact 


25860 


cccaaccagg 


caccaagtac 


ctcccaacca 


agtgccaagc 


acctcccaac 


caagtaccaa 


25920 
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ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 
catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 
gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 
cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca cctcctaaca 
aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 
ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 
caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggagac 
tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 
ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc ctctgtacag 
acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 
ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 
cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 
cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 
ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 
acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 
tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 
gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 
caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 
acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 
tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg 
accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 
cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 
ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 
gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 
ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 
ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 
gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 



25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
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27000 
27060 
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27180 
27240 
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aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 
tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 
gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 
gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 
cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 
gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 
cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 
acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa 
tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 
cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 
actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 
cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 
aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 
ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 
caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 
tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 
ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 
aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 
cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 
gtgagctgag atcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 
tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 
tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 
accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 
acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 
gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 
gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 
ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 
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tgagccttca 


agggcccagg 


tgaactcagc 


cagac cagga 


gctgaggagg 


acacagggca 


29220 


gcttccagaa 


cggacccgag 


aaccactccc 


agcaggttct 


gcttccagac 


aaggagctgc 


29280 


actttttcag 


ccaatgcaat 


tagaaagcca 


ggagaaggtg 


caaatt ccac 


ctgcctgagc 


29340 


gtccgcactt 


cccaggccgc 


ccaccataca 


cacagcaaag 


augcgcxt aa 


ccattcaaac 


29400 


ccatggccaa 


ccacatcggt 


tgcctcagac 


acgcaagc ct 


L.cid.aaaggaa 


cataactatg 


29460 


ggccaggcac 


ggtggttcac 


gtctgtaatc 


ccagcacttt 


999 a ggccga 


ggtgggtgga 


29520 


tcacctgagg 


tcaggagttc 


gagaccagcc 


tagacaccat 


ggtgaaaccc 


catctgtacc 


29580 


aaaactacaa 


aaattagctg 


ggcgtggtgg 


tgggcgcctg 


taatcccagc 


tacttgggaa 


29640 


gctgaggcag 


gagaatcact 


tgaacccggg 


aggcgaaggt 


tgcagtgagc 


cgagattgtg 


29700 


ccactgcact 


ccagcctggg 


caacaaggga 


gactccatct 


caattaaaaa 


aaaaaaaaaa 


29760 


aaaaaggaac 


ataactatgg 


agtctcaagg 


ggaagtaatt 


ccttcaacaa 


taacaaatct 


29820 


tgaaagctga 


gctctttttt 


ttttttgaga 


caggatctcc 


l_-___l_.l_.l_ —L. _ 

tcactttgcc 


gcccaggctg 


29880 


gagtgcagtg 


gtgggatcac 


agctcactgc 


agcctcgatc 


tcccaggctc 


aaatgatcct 


29940 


cctacctcag 


cctcccaaga 


agctgggatt 


acaggtgcat 


accatcacac 


ccgattcatt 


30000 


tttgtatact 


ttgaagagat 


ggggtctcac 


catgttgccc 


agtgtggtct 


tgaattcctg 


30060 


gactcaggtg 


atctgcccgc 


cttggcctcc 


cagagtgctg 


ggattacagg 


cctgagccaa 


30120 


cacccccacg 


ggttcatttt 


cagagtcgca 


ccgagtgctg 


gggttacagg 


cctgagccaa 


30180 


cccccccacg 


ggttcatttt 


aagagtgaca 


ccgagtgctg 


gggttacagg 


cctgaaccaa 


30240 


cccccccacg 


agttcatttt 


cagagtcgca 


ccgagtgctg 


gggttacagg 


cctgagccaa 


30300 


cccccccacg 


ggttcatttt 


aagagtgaca 


ccgagtgctg 


gggtxacagg 


cctgagccaa 


30360 


cacccccacg 


ggttcatttt 


cagagtcaca 


ccgagtgctg 


gggttacagg 


cctgagccaa 


30420 


cccccccacg 


ggttcatttt 


cagagtcaca 


ccctttttct 


gaaaaacaac 


ttgggctcat 


30480 


gcaaattcga 


gagagagatg 


gtgacactcc 


ccgccccctg 


gacccaggtg 


gagtcgcagc 


30540 


agggtttacc 


cgtgagcggg 


gtccaaggcg 


atggccctcg 


gctggtcaag 


gtcctgccag 


30600 


aagagcacct 


tccgggatgt 


gccattgagg 


ttggccacct 


cgatgcggt t 


ggtctctgag 


30660 


tccgtccagt 


acagcttctt 


gcccacccag 


tcgcaggcga 


ggccgtcggg 


agagaccagg 


30720 


ccggagatga 


ccacgttctg 


cacggcggcc 


cccgtctggt 


tcaggtaggt 


ctgcttgatg 


30780 



BNSDOCID: <WO 0177327A1J_> 



WO 01/77327 PCT/US00/16951 

98 

gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 30840 

gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900 

accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30560 

caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 3102 0 

gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080 

cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 3114 0 

ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200 

ctgttacgtc tgttccccca gatatgggtg acccttcctg ccatgcccaa aacctcccac 31260 

atcgtcctcc agaggctaca ggggccctgt cctgttctgc agagaagcca catccccttt 31320 

gttggcctga cacaggggat ggggacatgc aggcacagca ctggccatgc tgctcgctac 313 80 

agacccagcc acagggccac attttttgag gggttcagag cccaggccag acagagcctc 3144 0 

aagattccct tacaagtctt tgaccactgt ccaagctcag gcccgtttcc ttggccgtgg 31500 

catcagcttc ccatccaccc ctgtattcca tgtttctccc accctgcttc tggacattcc 31560 

tacatttaaa gggtcactct ggaatgccac cccttggctc agacaccttc cacagctccc 3162 0 

tgtgccagtg ccatgcagaa caaggtcaga ccccctagcc tggcctccaa ggccttggcc 31680 

tctggcctca cctacacttc tctccaccac cccaccccaa gcattcctga tctgcctgcg 31740 

gccaggctgg ctccctcacc tccctgtgca ccgcagccct cagccccttc tgcctgtgca 31800 

agaagcctca tctcacagac aacggtctca ttcccacaac gggctcaatg agaaatcagg 31860 

agaggccttc agaccatcac cccaccagac acctcagacg tcggaccagg agggtccagc 31920 

aacccccaac acagactcag agggactaag aagccacatg aggagtgaac acaagatgtg 31980 

gacaggagga ggttaagggc ctccagggag ctccatcagt ccgtgttctg ctgtcagcag 32040 

ggttaggctg ggctggccac aaacaccccc aaaaaacatc tgaagccttg gcttgaaaca 32100 

gctgacattc ctcatgaaaa ctgcagaccc ctgggtcctc ctgcgcagat gggggagccc 32160 

agccaacccc acactcccac cttcaccaag aaagagaaag ccaaaacaaa ctcaactcag 32220 

ccaatgacaa tcacagaact gaatcctgta gttagttcag ttggtttcat ttcagcaggg 3228 0 

gaaagatttg cagcctctat gagggtagct gggaacacaa agggccagag catggcccag 32340 

gagaccccag cgcagtgggg tagatggttc cgagcacagg cctccctgcc aagacaagca 3240 0 
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ctggctcaaa 


tcctggcccc 


tcccattccc 


aggagacatg 


ctccacagga 


tgggaggaca 


32460 


cacagaggac 


ctgaggccag 


gaaaatgaca 


gcggcgcctc 


cgccgcccca 


cccgtgctgt 


32520 


catcatctta 


ggtctacagt 


tctttgtggc 


aacgagggac 


actgtgaaag 


tcaaacaaca 


32580 


ggaaggcata 


ggccacaaat 


aaagacaaac 


gggacttcat 


gggaagctaa 


agattttgtg 


32640 


catcaaaaga 


cactatcgag 


agagtaaaaa 


ggcaacccac 


agaatgagag 


aaaatatttc 


32700 


caaatcatag 


atctactaag 


agattaatat 


ccatgaaata 


cagagaactc 


ctaaaactca 


32760 


acaatgagaa 


aacaactaag 


ccaactcaaa 


aatgggcaaa 


caacttgaac 


agacatttct 


32820 


ccaaagatga 


catataaatg 


gccaataaac 


acatcaaaac 


aggcttaata 


tatccctaat 


32880 


catcagggaa 


atgcaaatca 


aaactacaat 


aagataccat 


cttgcaccaa 


ttaggacggc 


32940 


tactatcaaa 


aaaacaaaat 


agcaagtgtt 


ggtgaggatc 


tggagcaact 


ggaacccttg 


33000 


tgcaccactg 


gcaaaaatgt 


gaaatggtgc 


agctactatg 


gaaaacagca 


tggcagttcc 


33060 


ccaaaaactt 


aaacacagaa 


ttaccatatg 


acccagcaat 


ttcgctttgg 


gttatatacc 


33120 


caaaagaact 


gaaaacaggg 


acacaatcag 


atatgcatac 


accttggatc 


acagcagcat 


33180 


ccttcccaac 


agctaaaaca 


tggaggcagc 


caggcatggt 


ggctcacgcc 


tgtaatccca 


33240 


gcactttggg 


aggctgaggc 


gggtggatca 


cctgaggtca 


ggagttcgag 


accagcctgg 


33300 


ccaacatggt 


gaaaccccgt 


ctctactaaa 


atacaaaaat 


tagctgggcg 


tagtgacggg 


33360 


cacctgtaat 


cccagctact 


cacaagtctg 


aggcaggaga 


atcacttgaa 


ccctggaagt 


33420 


ggacgttgca 


gtgagccaag 


attgcgccac 


tgcattccag 


cctgggtgac 


acagcgagac 


33480 


tctgtctcaa 


aaaacagcaa 


aacaaaaaca 


aaaaaacaaa 


caaacatgga 


agcaacccaa 


33540 


gcgtccctct 


actgagggat 


gaatagcggg 


gcaaaatctg 


ctccatccac 


acaatggagt 


33600 


actattcagt 


ctcaaaaagg 


aaaaagattc 


tggtcaggca 


cggtggctca 


tgcctgtaat 


33660 


cccagcactt 


ggggaggctg 


aggcgggtgg 


atcacctgaa 


gtcaggaatt 


caaggcccgc 


33720 


ctggccaaga 


ctggcaccna 


gctacacana 


aagtatangg 


ccccggaaa 




33769 



<210> 9 
<211> 72049 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> unsure 

<222> (8356) , (8385) , (38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 9 



tataccttgc 


gcggaccttc 


ggctcctgtg 


gtgaagacaa 


tatgaagaaa 


atagaaatta 


60 


cccataattt 


tgccacacag 


acttagttgt 


gtccatgtat 


cttgtgcacc 


ttttttctgt 


120 


ttacggatca 


aaatcgactt 


ttaqqqtcaq 

ZJ ZJ ZJ 


qcqcqqtqqc 


tcacacctgt 


aatcccaaca 


180 


ctttgggagg 


ctggagttgg 

zj zj zj zj 


qqttqqqqqq 


tggatcactg 


aaga t c agga 


qtttcraqacc 


240 


agcctggcca 


acatggcgaa 


actccatct c 


tactaaaaat 


aaaagattag 


ccaaacataa 


300 


tggtgggtgc 

ZJ ZJ ZJ ZJ ZJ 


ctctaatccc 


agctactccg 


qaqactcraqci 


caggagaatc 


gcttgaaccc 


360 


aggagacaga 


aat t crcaot a 

ZD ZJ w -3 ZJ 


aaccaqqatc 


acgccactgc 


actccagcct 


crcrcaacacracr 


420 


cgagactctg 


tctcaaaaaa 


aaaaataaaa 


ataaaataaa 


taaatacata 


aattgacttt 


480 


taggagattg 


gttcaaacaa 


tgtgtgtaat 


gttgtgtctg 


agtgtttttc 


atttatcgtt 


540 


catgcaaatt 


ccgacatcat 


tcactcttct 


ccagagtgtg 


ctgttttcct 


gcctgtgtca 


600 


tcacccgtca 


ccttgaatgc 


cctcgtttag 


gtaaaataag 


tacattttat 


tcaaaaatat 


660 


ttgaggacat 


ttgggttgtc 


tccaggttct 


tggtcttgag 


ttttgctgtt 


cttgtggagc 


720 


catggtggtg 


tctggttgca 


ggaacctcca 


tgcgttccag 


ctgctgcttc 


tgcctgtgtt 


780 


cttagagagg 


aaatgctggg 


gtccgcggtt 


cccgggctgc 


tgaccaggaa 


gcctgcggtg 


840 


ctttacggcc 


cttccagaag 


cgggagatgc 


ccccacttaa 


gtgtcagaca 


ggcctttcca 


900 


cctcactggc 


agctctgagc 


ggctcccttc 


tatttgcaga 


tgactgagaa 


gttaccaatt 


960 


tccacgttta 


ctgactgctg 


tttctcctgt 


taatttgtat 


ttatagtctt 


cgctaattta 


1020 


ttgctagggt 


tttggtgttg 


tccctattga 


cttgtatgcc 


ttttaatttt 


ttaaacaaca 


1080 


ttaatatact 


tcattttttt 


agagcagttt 


taagtttaca 


ggaaaattaa 


gggacaagta 


1140 
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cagagagttc 


cttccacctg 


ctgtcctcct 


ctcctcctcc 


ccaccttccc 


tccttcccct 


1200 


attgtaactt 


tctttctgat 


attataaaag 


.tcactcatgg 


ctgggcgtgg 


tggctcacgc 


1260 


ctgtaatccc 


agcacgttgg 


gaggcagagg 


caggcagatc 


acctgaggtc 


aggagttcca 


1320 


gaccagcctg 


gccaacatgg 


tgaaaccccg 


tctctactaa 


aaacacaaaa 


agttagccag 


1380 


gcgtggtggc 


gggcacctgt 


aatcccagct 


actcaggagg 


ctgaggcagg 


agaatggcgt 


1440 


gaacctggga 


ggcagaggtt 


acagtgagtc 


gagatcgcgc 


cactgcactc 


cagcctgggc 


1500 


aataagagtg 


aagcttcgtc 


tcaaaaacaa 


agtcacacac 


gcttcttgta 


cgagggtcat 


1560 


ttggccgagg 


ggccagatgg 


ctcaccatct 


agttgggaca 


ggccatgagc 


tcggaatgct 


1620 


ttttacatat 


ttacatggtt 


gagaagaaaa 


tcaggagaat 


aatgttttgg 


gacatgggaa 


1680 


aatgacatgg 


aatttgcatt 


ttagtgtcca 


taaatgaagt 


tttgtttgct 


cccagctgtg 


1740 


ttgactgagg 


caggctggct 


tcctacagct 


gcggcagagc 


tgaggaggcg 


ggaaggagac 


1800 


cgtgcaggcc 


gcagcaccga 


aaatatttgc 


tctctggccc 


ttcccagagt 


gcttgccgac 


1860 


ctctgtccga 


cagctagaaq 


gaaggatagg 


acccgtccga 


cgataaccac 


tgttgacatt 


1920 


tgagcgcgtt 


tccttcccgg 


cttttgtgtg 


agagtggcag 


tctgtttgct 


tttgtggtcg 


1980 


ggatctgctg 


cacgcacggc 


gggctgtttg 


catgaggctt 


cctggaggat 


agggctgggc 


2040 


tcggagctgc 


acgcagtggg 


gcgtgtcctg 


catgcagtgg 


ggcctcagaa 


gagagctgtg 


2100 


gtgggcgggg 


cagtgccaac 


gctggtgggt 


gccaggcctc 


cacgctcaga 


tcagccccgg 


2160 


cgacaggttt 


gggccaccct 


ctctctggcc 


tctgtgcagt 


ggcccaggcc 


gtctgctctg 


2220 


cctggcacac 


ttgcctctgt 


ccttccactg 


aagcgctcct 


cttaccctct 


gctcccggct 


2280 


gggtacgttg 


aattgtgtcc 


ctcaaggaga 


tatgctaaag 


gtctaacccc 


aggaacctgt 


2340 


gtatgtgatc 


taatttggaa 


acagggtctt 


ggctgatgta 


atcaagcgag 


gatgaggtca 


2400 


ccctagagta 


gggggcctat 


atccacggtg 


ctggtgtcct 


catgagagca 


ggtgagcaga 


2460 


cactgacact 


caggggtgaa 


ggctgcatgg 


agtcagaaca 


gggcttagtg 


cgatggcggc 


2520 


cacaagccaa 


ggaactccaa 


gtatttcctg 


caacaccaga 


agctggaaga 


tgccaggaag 


2580 


gatcctgccc 


tggagccttc 


ggagggagtc 


tgtccctgca 


gacgtcttga 


cttttgattg 


2640 


cagggatgca 


tgtcttaggg 


tgtgtggggg 


ggtgcatttc 


tgatgttaga 


agccacctgg 


2700 


ttggtggcga 


tgtgtcacgg 


gagccctctg 


caggttctgc 


gtgtccatgt 


ggtcggggac 


2760 
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agaggtgggc 


agggacggac 


ggtgtcgagc 


tggacatgtc 


catgacgtcg 


gccatccctt 


2820 


gggatggctt 


ttttgttttg 


aggataaggc 


tgcctgccag 


gaagctgtgc 


cctgcctggc 


2880 


ccttgcccca 


agcccctggc 


ctgtgcttgg 

3 ~J — > ~i 


cctcgcggaa 


gggatgtcgc 

33 3 3 v 3 


ccttctctcc 


2940 


tgcatgcgtg 

3 ZJ ^ ZJ ""3 


cagggaggaa 

ZD ZD ZD ZD ZD 


QQQQagaggt 

ZD ZD ZD ZD ZZ3 ZD ZD w 


cagcagcccg 


cctqcraagaq 

33 33 3 


gctcgggcga 

ZJ 333 3 


3000 


ggggaaggtt 

ZZJ ~J ' > ^ 3 


tcactttcag 


gcaatgttgt 


qggqctgttt 


aaacaacccc 


aaagaaaacc 


3060 


atttggccaa 


actgttagtt 


tccaaacatt 


ttacttcctt 


ggtgtttaaa 


taaattccta 


3120 


ccaagactct 


gtagctggtc 


ccagggaagg 


agttggcctc 


tcttctttat 


agcccggcac 


3180 


agtcagtccc 


ctgcacctgc 


ccctcccaac 


cccaggcctg 


cttccccgtg 


gccatggctg 


3240 


ctgcccggac 


ctctctacac 


acagaacctc 


ctqgaggcca 

ZJ ZJ ZJ ZJ 


gctgtgggca 

3 3 *"333^'*^ 


ccagccttgg 


3300 


cagggctgtg 


gcggagccca 

ZJ ^3 ^< ZJ 


ggctgctggt 

3 3 3 ZJ ZJ 


actctctctg 


cagctgctcc 


ctgctggcct 


3360 


ggctggacag 


cgtccccacc 


accactgggg 

ZJ ZJ ZJ ZJ 


tcacctctgt 


gctggtcaca 


gctcactcag 


3420 


accttcaggc 


aaatgggttg 


gatcctgcct 


ctctcccagg 


tgtctcagtc 


tctgcaaaac 


3480 


tcaaaaacct 


cagaggcctt 


gcagcctgag 


gggtgtcaga 


gacacctcct 


tcgaatcagt 


3540 


aaacacctac 


agattcaccc 


cagcagtgaa 


aggactgctt 


cgccacagag 


gtttgattta 


3600 


ctcctaagta 


a 1 1 agaacrgg 


atgccgagaa 


taggttcctc 


atcrQtQQQac 


tagaggccct 


3660 


cfcgctgacct 


agttaacaga 


ctqqc t aggg c 


tgggtgtgct: 

ZD ZJ ZJ ZD ZJ 


cagcccctga 


aggttctagq 

33 3-3 


3720 


cccatttggg 


acaccccgcc 


agaacctgcc 


acaacctgcc 


atgtggtgac 


agctacctaa 


3780 


atcccagagg 


ctcttgagct 


ggagagcaga 


cctctcaatc 


tcagcaggcc 


ccccacacag 


3840 


accccataac 


cctagtctgc 


cttcacagta 


cagttcgtgg 

ZD 3 3 3 


ctatgtgttc 


acggatggtg 

33»*» w '33 "*3 


3900 


ttgttcacct 


aaggtctctg 


ccctgtgacc 


ccaagggcgt 

3 33 3*"^ 


cctgagggca 


gattccaagt 


3960 


ctgtttcgtc 


cacccctcct 


tccctagcag 


cgcrgtccagg 

w 3 33 w ^ZDZD 


qccfcqqcctq 


aactagcttc 


4020 


ccacagagat 


actggtggga 

ZJ ZD 333 


tgatgaaggc 

3 —3 ZZ) ZJ 


agccaggcgg 

3 33 33 


caagtgaaaa 


acgcacttcc 


4080 


tgcatgtgct 




attgaagtgt 


ttgaggaagc 


aaagtgaagt 


gagctttcct 


4140 


cttgcggctg 


tgtgtccttg 


ggccgggagc 


ctaccctctc 


tgagcgttgg 


ggtccttgtc 


4200 


agtagaatgg 


ggcatcctca 


tagctcaagg 


ggtggtgtgt 


gaaaattgtg 


ctattgtgtt 


4260 


actttaatga 


tttttttttt 


ttcgagacaa 


agtctcaccc 


caacgcgcag 


gctggagtgc 


4320 


agfcggcgcga 


tctcagctca 


ttgcaacctc 


tgcctcctgg 


gttcaagtga 


ttctcctgcc 


4380 
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tcagcctccc 


aagtagctgg 


aattacagga 


gtgcgccacc 


aggcccggca 


tatttttcta 


4440 


tttttagtag 


agagggggtt 


ttaccatgtt 


ggctaggctg 


gtcttgaact 


cctgacctca 


4500 


ggtgatccac 


ctgcctcggc 


ctcccaaagt 


gctgggatta 


caagcatgag 


ccaccgcgcc 


4560 


cggcctactt 


tagtgatttc 


ttaggaggac 


agagggaacg 


ggctggcaag 


acaggcttgg 


4620 


aatgtgtttt 


gggatcaagt 


gccggtttct 


gtctggcact 


ggcgttctct 


gtggggccat 
■ — > — * ■ — > — * — i 


4680 


gatggacaca 


ctgctgaggt 


caagcgtgat 


tcgtcttgcg 


ctgtgcctgg 


cagtctcatt 


4740 


ggaaagttct 


gtagacatcg 


tgtggatggg 


gctcttcccg 


gccaagccct 


tggggacctt 


4800 


ccaggactgt 


gatctcccca 


cagtggctgt 


taagcaggga 


cctttcgtga 


agtggagtct 


4860 


ctggtcccct 


ccaagtcata 


gctagacagg 


gactcgggca 


tcgccaagcc 


tggctgatta 


4920 


ttcactggat 


gaggagacag 


gcccagagag 


gggcaggaac 


ctgcccgagg 


tcacccagca 


4980 


ggccccagag 


gtttcggtct 


cggattctcc 


ctgctcatcc 


ctggatgtag 


tgctgctgtg 


5040 


qatqtqqttc 


tqtqctqqqq 


qctqtqqaqa 


qcaqqqqqct 


tgtgccagga 


ccccagtgag 


5100 


qgtggcgccc 


tcgccatgag 


gccgactgtt 


ggtatggggc 


ggccatccac 


tggggtgtgg 


5160 


ggagqaacag 


ctttcctgag 


gaggaggtgg 


cgggaggaac 


agcttccctg 


aggaggaggt 


5220 


ggcggtgctg 


tgtgacctgg 


gccttgaagg 


acaggtccat 


tgtcaacaga 


acattttggg 


5280 


agtggagcct 


agagggagaa 


aatttgttga 


aattcagatt 


cccctccccc 


taccaataca 


5340 


caccaaatca 


gatgcccctg 


accagatcta 


aatttggctc 


tcagagattt 


ccattgtagc 


5400 


tgqqcacttq 


gggaaccttc 


taagtgctgc 


ctctgcctct 


ccccagcctg 


cctgcctcag 


5460 


tttccccagc 


cctgggcccg 


tgtcgctgtt 


gccatcacgt 


gggcgccctc 


tagtggagga 


5520 


atcagattat 


gcactccggg 


gcttggagca 


ggagtcagga 


ggggctcctg 


tctttccttg 


5580 


aaacgttgga 


tgccgggatc 


ctggaacagt 


ctctgcattc 


ctcctggcga 


gaaccagagc 


5640 


ctgggcacag 


gggaccatct 


gttgtttgaa 


ggctgcagcc 


tggcagggca 


ctcaggagat 


5700 


ctcfQcacrttQ 
3 ^3 


qctqcaqqqc 

-3 ^3 -3 3 3 


C aqq t C t aqq 


qqc C aqqqc a 


tcaqqqaqqc 


tctqqqctqq 


5760 


ttcagccccg 


ggcccctttg 


cagattgtga 


cctgggcccc 


tgtgcagggg 


catggccaca 


5820 


ggatgctggg 


aggggtctct 


gaccctgacc 


ttcttggctc 


tgtgcatcct 


tgagaccaga 


5880 


aaggtctgga 


acaaatgagt 


agacgatgcc 


ctaacctggg 


gagggagcca 


catcctgatc 


5940 


ccagcaacct 


cgggaaggat 


ctgtcaggat 


tatggggcac 


cctgggggcc 


ccaagtctgc 


6000 
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atgggtctcc 


acttgcaatt 


tctgtaggaa gctctgataa 


atccaaactg 


ggggtcctag 


6060 


gacacagtca gaaatgctga 


taccgttgtg 


tgtggagcct 


cgggccc tgg 


gggtcaggag 


6120 


catgtggagg gtgggccacg ggggttcaga agagaatcct 


gtaacccccc 


accccccaaa 


6180 


ctgaagccca 


cttgagggcc 


atggctgaaa 


ggttgggggg 


tctccgtgcg 


tcctgtggag 


6240 


tgggtggtga 


ggagtccttg 


ggtttgcacg 
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ggggagagaa 
tcaatgcctc 
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tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc tgggattaca 
agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg gtttcaccat 
gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc ggcctcccaa 
agtgttggga ttacaggcgt gagccaccgc gcccagccct ctgttgagca tattttgagg 
ttctcttggt gccagtgata tgtacatgtg tccccatcgc accatcgtca cccattgagg 
tgacattggt gcctctcctc ggggtggatg cctccctctg tttccagcaa cttctgaagg 
attttcctga gctgcatcag tccttgttga cgtcaccatc ggggtcacct ttgctctcct 
cagggctccc aggggaggcc cgaatcaggc agcttgcagg gcagggcagg atggagaaca 
cgagtgtgtg tctgtgttgc aggatttcag accctgcttc tgagcgggag gagtttcagc 
accttcaggg tggggaaccc agggatgggg gaggctgagt ggacgccctt cccacgaaaa 
ccctaggagc tgcaggtgtg gccatttcct gctggagctc cttgtaaatg ttttgttttt 
ggcaaggccc atgtttgcgg gccgctgagg atgatttgcc ttcacgcatc cccgctaccc 
gtgggagcag gtcagggact cgcgtgtctg tggcacacca ggcctgtgac aggcgttgtt 
ccatgtactg tctcagcagt ggttttcttg agacagggtc tcgctcgctc acccaggcga 
gagtgcagtg gcgcaatcac ggctcgctgt agcctcaatc tccctgggct caggtgatcc 
tcctgcctca ccctctgagt agctgggact acagacacat accaccacac ccagctagtt 
tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 
ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 
cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 
tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 
cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 
ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 
gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 
gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 
aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 
catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 
ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 
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tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 
gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 
aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 
cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 
ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 
cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 
attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 
cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 
ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 
cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 
gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 
ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 
gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 
actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 
ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 
tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 
tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 
agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 
gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 
ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 
gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 
ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 
tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 
gcaggaggcc tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 
agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 
gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag 
gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 
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tgacgggtcc 


atgcctgggc 


ataagtgttg 


agctcaggtg 


ccccgacctg 


gggaagggca 


25260 


ggacaggaaa 


ggtgacagta 


tctggccaag 


gacagatggg 


aagggaccaa 


gggagctgat 


25320 


tagggagtgg 


ttatggacta 


ggaatgtcgg 


taacaatggt 


tagaaagtga 


ctaacatttg 


25380 


ttgagcacct 


gctgtgtgcc 


cggccctggc 


egggagcett 


cgtgcccaca 


gtgaccccgt 


25440 
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ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 
cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 
ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 
gaggaggtcg ttgtctggca caatgatttg tctcttcctg aaaaggtgac agagttacac 
tggagagagc agcatccagg tgcggcaggg acaggcctgg ggctcgcggg cagggactct 
gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 
gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 
tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 
taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 
gaagcctgga gctgaggcag tgtgggcgac ccc tggagca gtgagtgctt ccttcatggc 
cttcatcgca ccctgcagtc ctcatgtagg ggatgccatc catgaattta gttttcccag 
cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 
cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 
ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 
cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 
aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 
cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 
ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 
aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 
tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acafcttatgc caggtgtggt 
ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 
gaaatttgag agtgtcttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 
aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 
gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 
ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 
tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 
cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 



25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 
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cttcttcccc 


ctcccctctc 


accccgtggt 


ttgctaaagg 


agatgggaaa 


aaggattctt 


27120 


tttttggctg 


aaatatttaa 


cactaaatta 


aagccaattt 


taacagcact 


ttggttgatg 


27180 


agtgaaatta 


acagactggc 


caaaaataaa 


cgaacggtct 


gtactatgtg 


aaaaagaggc 


27240 


agctttggcc 


atgctgggcc 


aatgtgagtt 


ttcagggttg 


ctgggaatgt 


ctgtgaatcg 


27300 


gaggaagggc 


ctagctggga 


ctctcaggag 


ccaaggccct 


gaggggcaac 


ttgcctggtc 


27360 


cctgccctga 


ggcgttcact 


gctttcttcc 


tgggccagat 


cacaggcccg 


gaggctggac 


27420 


cactcraqctcr 


gcactcttgc 


cgagctgctc 


cctgacttcc 


tgaccatgct 


cctttcagca 


27480 


accfctactcrc 


actttagttt 


ccttgaatga 


aaaatgggga 


tgagaatagc 


tcctacctcc 


27540 


aaggtgaatg 


gagtgagttc 


ggacaggtga 


ctccctggga 


ccagtgcctg 


gcgcctgaca 


27600 


aggtccagtc 


agagcccgca 


ctgctgttac 


tgataccctt 


ggctgtacca 


ggggagaact 


27660 


tggttgccat 


tgccaggtgt 


tctcccacca 


cccccactac 


tgtccctgtt 


tgatgtgtgg 


27720 


cgggaataaa 


gctgtgcaca 


ttggagcttt 


tggcacatcc 


tggctttcag 


gtgaaaggtg 


27780 


catatcrtatt 


tcfaacrcrttta 


gcctggccaa 


cccagccatg 


aggtcggacc 


tgacctgggg 


27840 


qtciaqtcctcr 


agctcggcac 


ccctgagctg 


tgtggctcac 


ggcagcattc 


attgtgtggc 


27900 


ttgggccgca 


cccctttccc 


tgctgggctg 


ttgatgttta 


gactggagcc 


tctgtgttcg 


27960 


cttccaggaa 


ccaacccgtg 


tgcggacagg 


aacggggggt 


gcagccacct 


gtgctfcctgc 


28020 


acaccccacg 


caacccggtg 


tggctgcccc 


atcggcctgg 


agctgctgag 


tgacatgaag 


28080 


acctgcatcg 


tcrcctaacTQC 


cttcttggtc 


ttcaccagca 


gagccgccat 


ccacaggatc 


28140 


tccctcgaga 


ccaataacaa 


cgacqtagcc 


atcccgctca 


cgggcgtcaa 


ggaggcctca 


28200 


gccctggact 


ttgatgtgtc 


caacaaccac 


atctactgga 


cagacgtcag 


cctgaaggta 


28260 


gcgtgggcca 


gaacgtgcac 


acaggcagcc 


tttatgggaa 


aaccttgcct 


ctgttcctgc 


28320 


ctcaaaggct 


tcagacactt 


ttcttaaagc 


actatcgtat 


ttattgtaac 


gcagttcaag 


28380 


ctaatcaaat 


atgagcaagc 


ctatttaaaa 


aaaaaaaaga 


tgattataat 


gagcaagtcc 


28440 


ggtagacaca 


cataagggct 


tttgtgaaat 


gcttgtgtga 


atgtgaaata 


tttgttgtcc 


28500 


gttgagcttg 


acttcagaca 


ccccacccac 


tcccttgtcg 


gtgcccgttt 


gctcagcaga 


28560 


ctctttcttc 


atttatagtg 


caaatgtaaa 


catccaggac 


aaatacagga 


agactttttt 


28620 


tttttttttt 


tgagacagag 


tcttactctg 


ttgcccaggc 


tggagtaccg 


tagcgtgagc 


28680 
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tcagctcact 


gcaacctccg 


cctcccaggt 


tcaagcgatt 


cttctgcctc 


agcctcctga 


28740 


gtagctggga 


ctacagacat 


gcaccaccac 


acccagctaa 


ttttttttat 


atttttagta 


28800 


gagacagggt 


ttcatcatgt 


tggccaggct 


ggtcttgaac 


tcctgacctc 


aggggaacag 


28860 


acggggttgg 


cctcccaaag 


ggcggaaata 


acaggggtga 


gccaccgttc 


ccggcctagg 


28920 


aaaacttttt 


gccttctaaa 


gaagagttta 


gcaaactagt 


ctgtgggctg gccttctgat 


28980 


tctgtaaaga 


aagtttgatt 


ggtggctggg 


tgcggtggct 


cacacctgta 


atcccagcac 


29040 


tttgggaggc 


cgaggtgggc 


agatcacctg 


aggtcgggag 


ttcgagacca 


gcctcaccaa 


29100 


cgtggagaaa 


ccccgtctct 


actaaaaata 


caaaaaaaaa 


attaaccggg 


catggcggcg 


29160 


cctgcctgta 


atcgcagcta 


ctcaggaggc 


tgaagcagga 


gaattgcttg 


aacctgggag 


29220 


gcggaggttg 


tggtgagctg 


agatggcacc 


attgcactcc 


agcctgggca 


acaaaagtga 


29280 


aactccgtct 


cagaaaaaaa 


aaagtttgat 


tggtgtaacc 


aaagcgcatt 


tgtttatgga 


29340 


ttgtctgtgg 


cagcttttgt 


tctgccgaga 


tgagttgtga 


cagatctgta 


tgggctctaa 


29400 


agcctaaaac 


atgtgccatc 


cgccccttta 


cagaaaaagt 


gtgctgacct 


ctgttctaaa 


29460 


gtattggaca 


actacaatgt 


ttgctcattt 


attattctat 


gatttgtttt 


ctgctttttg 


29520 


ttgttgttgt 


tgttgttgag 


atagggtttc 


cctctgtcac 


tcaggctgga 


gtgcagtggt 


29580 


gtaatttcag 


ctcactgcag 


cctcgacctc 


ctgggctcta 


gtgatcctct 


catctcagcc 


29640 


tccctagtag 


ctgggactac 


aggcacacac 


caccactcct 


ggctgatttt 


tttttttttt 


29700 


tttttttttt 


gtggagacag 


ggtttccgca 


tgttgcccag 


gctggtttca 


aactcctagg 


29760 


ctcaaacacc 


cacctcagcc 


tcccaaagtg 


ctgggattac 


aqqcqtqaqc 

zj zj zj Zj zj 


caccatgccc 


29820 


agcctattct 


actgtttgta 


ttacatagct 


ttaaaagatt 


ttttatgact 


ttaagtcaca 


29880 


agggttcttt 


gtagaaaaaa 


atatatatat 


aggaaagtat 


aaaaagaaag 


taaaaattgt 


29940 


ccataacctc 


tccagccaga 


gacgaccgtt 


gctgacacct 


cagcatattg 


cctttaagtc 


30000 


ttttttctct 


aagatagcat 


ttctcttcat 


cacagtcata 


tgctacgcag 


aattctgtat 


30060 


cctgattttt 


tcacttgaca 


ttacaacagg 


tatttgatgg 


cgctgtgaca 


aactctttgg 


30120 


cacaatcttt 


taaatgtatg 


aaatactcca 


ctgcacagat 


gtttgctttt 


aggcttaact 


30180 


gttcttttat 


tttgcgtgtg 


ctggttacag 


ccgggcacag 


tggctcatgc 


ctgtaatcac 


30240 


aacactttga 


gagggtgagg 


caggaggatc 


acttgagccc 


agaagtttga 


gaccggcctg 


30300 
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ggcaacatag 


tgagacccca 


tctctacaaa 


aaactttttt 


aataagtcgg 


crcQtacftaat 

3^-3 *~ ^ 3 ^33 


30360 


gcatagctgt 


agtcccagcc 


accaaggagg 


ctgagttggg 


aggattgctt 


gaqccccacrci 

3 3 w w 3 3 


30420 


aggttgatgc 


tgcagtgacc 


tgagattact 


ccactgtact 


ccaacctgag 


cgacagagca 


30480 


agacttgtct 


ggggaaaaaa 


aaaaaaaaaa 


tatatatata 


tatatatata 


tatatacata 


30540 


tatacataca 


cgcacacaca 


cataatataa 


aaatatatat 


ttataaatat 


ataatatata 


30600 


atataaaaat 


atatatttat 


aaataaaatt 


tataaatfcat 


atttataagt 


aaatatataa 


30660 


tatataatat 


aaaaatatat 


attatataat 


atataataaa 


atatataata 


taaaaatata 


30720 


tatttataaa 


taatatataa 


tacatactta 


taagtatata 


tttaaaatat 


atgtaatgta 


30780 


tattttttaa 


tgtatgatat 


ataatataca 


tttataaata 


cacatttata 


ttattttata 


30840 


taaaatatat 


ataaaatctc 


caagttgctt 


tttccaaaaa 


ggtgtcttgc 


tgcatttcaa 


30900 


acattcattt 


aaaaacttga 


atgctggtga 


tctggtccag 


aatgtgttca 


gtagctgctg 


30960 


ccagtggcca 


agcatctcgg 
^ 33 


gagatgtcta 


caaaacacgc 


tggttctggc 

3 3 ^33 


ctggcgtggt 

3 3 3 3 3 


31020 


ggctcacgcc 


tgtaatctca 


gcactttggg 


aggctgaggc 


aggtggatca 


actgaggtct 


31080 


ggatttcgag 


accagccttg 


ccagcttggt 


gaaaccccat 


ctctactaat 


aatacaaaaa 


31140 


aattagccag 


gcgtggtggc 


atgtgcctgt 


aatcccacct 


acttgggagg 


ctaaggctgg 

3 3 3 3 


31200 


agaatcgctt 


gaacccaggg 


qqcaaacrqtt 

3 3 3 _J 3 


gcagtgagcc 


gagatcgcac 


cattgcactc 


31260 


caggctgggc 


aagaagagcg 
— i — > i — > 


aaactccgtc 


tcaaaaaaaa 


aaaaaaagat 


gctggttcct 


31320 


aaaatgtggc 


ccttttcctc 


ctcacctgct 


gccagaccat 


cagccgcgcc 


ttcatgaacg 


31380 


ggagctcggt 


ggagcacgtg 


gtggagtttg 


gccttgacta 


ccccgagggc 


atggccgttg 


31440 


actggatggg 


caagaacctc 


tactgggccg 


acactgggac 


caacagaatc 


gaagtggcgc 


31500 


ggctggacgg 


gcagttccgg 


caagtcctcg 


tgtggaggga 

mJ —i ' t > -J 


cttggacaac 


ccgaggtcgc 

3 33 ^3 


31560 


tggccctgga 


tcccaccaag 


gggtaagtgt 


ttgcctgtcc 


cgtgcgtcct 


tgtgttcacc 


31620 


tcgtatgaga 


cagtgcgggg 


gtgccaactg 


ggcaaggtgg 


caggctgtcc 


gtgtggccct 

3 3 3 3 


31680 


cagtgattag 


agctgtactg 


atgtcattag 


ccttgatggt 


ggccaggact 


ggtagggccc 


31740 


tcagaggtca 


tggagttcct 


tcgtggagcg 


ggtgctgagg 


ctgtatcagg 


cacagtgctg 


31800 


gctgctttca 


cctgggccgt 


ctcaccgaag 


tgtccatgga 


gcctgcgtag 


ggtgggtatc 


31860 


tgtgtcgatt 


ttacagatgc 


agaaacaggc 


tcagagaaac 


cgagtgactt 


ccctaaggtc 


31920 
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acatacccag 


ttagagcaga 


gctgggccag 


gaagtgctgt 


ctcaggctcc 


tgaccaggtc 


31980 


tccttgcttt 


gcactcttgc 


caaaaccatg 


atccagaact 


gactttgagg 


tccccggacc 


32040 


tcaggctcct 


ccgaaatggc 


ctcttggagg 


ctgctgagcc 


acagcttagg 


acccacctcg 


32100 


agaggcaaat 


gtgctttgag 


ctgccaggcg 


tcctgggggc 


cctgccttgg 


gcacggggtt 


32160 


cagacaggcc 


ccagatgtgt 


ggggcgtctt 


tctggacttg 


agttttcttt 


tctgtgtggt 


32220 


ggacacagtg 


ctcacccctt 


aaagcacctg 


tgatgtgtgc 


agcagcccaa 


tccctgcctg 


32280 


tcgcctgttc 


tgctagggaa 


ggaaggaata 

3-3 ZJ ZJ 


cttcaggatg 


gcaggacaac 


agaaagaggt 


32340 


ccaggtttta 


gagcaagggc 


aggtcaaact 


tagaaaattc 


tggaatgagg 

— ' — ' —i ~-> ZJ 


atgtgcattt 


32400 


cctcttctgg 


atctgctaaa 


agaagaggga 


a gg a gggg ct 


gctgggggag 


gagcccagag 


32460 


ccgagtttac 


atccggatcc 


cgcaaggcct 


cccctgccct 


gaggtcttgt 


tttgtgatgt 


32520 


gcttgtgtcc 


atcctggttt 


ctgccgtgtc 


cccaacatcc 


ggccaagctt 


aggtggatgt 


32580 


tccagcacac 


actcaccctg 


tctgtgcacc 


tgtttttgtg 


tccgtaagtg 


ggtatttact 


32640 


caccttacga 


gtgagccact 


gtqqqaattc 

ZJ ZJ ZJ ZJ 


aqqqaqqtqq 

zj zj zj zj zj zj zj 


cgcagtgacc 


acccctggag 


32700 


ggatatgtgt 


gtggcagggg 


tcgagggtct 


cgcccttccc 


tgcttcctgc 


gcgtggcttt 

ZJ ZJ ZJ ZJ 


32760 


ctccaggacg 


gggagggctg 


agctgaagag 


gtggggacag 


ttgcgtcccc 


ccgccaccca 


32820 


ctgtcctgcg 


gtgagagcag 


actcactgag 


cctgcccttc 


tcccttgtgc 


cttccagcta 


32880 


catctactgg 


accgagtggg 

ZJ ZJ ZJ ZJ ZJ 


gcggcaagcc 


gaggatcgtg 

ZJ ZJ ZJ ZJ ZJ 


cgggccttca 


tggacgggac 

ZJ ZJ ZJ ZJ ZJ 


32940 


caactgcatg 


accrctqqtqq 


acaaqqtqqq 


ccgggccaac 


gacctcacca 


ttgactacgc 


33000 


tgaccagcgc 


ctctactgga 


ccgacctgga 


caccaacatg 


atcgagtcgt 


ccaacatgct 


33060 


999tgagggc 


cgggctgggg 


ccttctggtc 


atggagggcg 


gggcagccgg 


gcgttggcca 


33120 


cctcccagcc 


tcgccgcacg 


taccctgtgg 


cctgcaagtt 


ccccaacctg 


gcaggagctg 


33180 


tggccacacc 


cacgactgcc 


cagcagcctc 


accctctgct 

ZJ 


gtgggagttg 

ZJ ZJ ZJ ZJ ZJ ZJ 


tccccgtcca 


33240 


cccctgqgtg 

ZJZJZJ 


cctttgctgc 


agttatgtcg 


qqaqaqqctc 

^jzj^zj^ZJZJ^* 


tqqtqacaqc 

ZJ ZJ — 3 


tgtttcctgt 


33300 


gcacctgctg 


ggcactaggt 


cccagctaat 


ccctgtgcca 


ggactctaat 


ttcaccctaa 


33360 


cacacatggt 


ggttttcatt 


gctggggaag 


ctgaggcctg 


agcacatgac 


ttgccttagg 


33420 


tcacatagct 


ggtg a gttca 


ggatccccca 


gagataccag 


ggccagcact 


cgatccccac 


33480 


ccagccctga 


accccaccat 


gtgctgggat 


tgtgctggga 


gtgtccacac 


gcctgggacc 


33540 
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ccagggctgg 


tgctctcatc 


tcctttttcc 


agatcatgag 


aatgaggctc 


a gggaagttt 


33600 


gaaaaaaacc 


tatcccaagt 


cacacagcaa 


caggagcagg 


atttgaaccc 


agaaaagggg 


33660 


accgcacact 


ctgttctgct 


agagtagtta 


gctgtcctgg 


gtgatatggc 


aggtgacagg 


33720 


qqcaactqtq 

ZD ZD w »— j 


cttaacaaag 


gaacccccat 


cccccctgcc 


aagttgggag 


actagaaggt 


33780 


caggggcaga 


agcfcctgaag 


ggccaggtgc 


agtggctgac 


acctctaatc 


ccagcacttt 


33840 


gtgaggccaa 


ggcgggcaga 


tgatttgagc 


ccaggagttc 


aagatcagcc 


tgggtaatgt 


33900 


agtgagacgc 


catctctaca 


aaaaaatttt 


ttaaaaatta 


gctgggcatg 


gtggttcatg 


33960 


cctgtagtcc 


aagctacttg 


ggaggctcag 

— ' ZZJ — J — 1 — * 


gtgggaggat 


tgcttgagcc 


caggaggttg 


34020 


aggttgtggt 


gagctgtgat 


catgccactg 


cactccagcc 


tgggcaatag 


agtgagaccg 


34080 


tctccaaaaa 


aaaaaaaaga 


agaagaaaaa 


gaagctctga 


ggctccaagt 


ccccaggcac 


34140 


cccttggctt 


gagggcagac 


aagggaggag 


agggtcacct 


gggcagccct 


gacttttgtc 


34200 


ccctggcaaa 


gggaccttca 


gtgaccttgg 


ccctaggaga 


gcctctgagc 


acgtcagcca 


34260 


fcgtcgaaccg 


ctcaggaagg 


gcagcaagaa 


tttggcttct 


gacctctgcc 


tctcctactc 


34320 


gccatctgca 


ctqqqtqtqq 

ZD ZD ZD *"* ZD ZD ZD 


ttgtgcccat 


tttacagatg 


aggaggctgg 

ZD ZD ZZJ ZZJ ZZJ ZZJ 


ggcatcgacc 


34380 


agctgaatgc 


cttgtcccag 


gtactgcgta 


ggcagagctg 


gcagttgaac 


cccgtgtcct 


34440 


ggttgtcgct 


gggggtgggc 


tgcaccctga 


cttgtgaggc 


cagtagcaag 


gtttgcacgt 


34500 


gacttcgtga 


ccgtcaccca 


gctctgcagc 


acatcccgtg 


acccagctca 


tccaggccgc 


34560 


atgcaaacct 


gttgccaggc 


gagaaaccag 


tcaccgcaca 


gctgtggttg 


cctgaaatga 


34620 


ttaagctcat 


taatcacccc 


ggagtgagga 


cagactcaga 


tgaaaaccag 


caaaagccct 


34680 


ggaaactcat 


gtgaccctgc 


caatqaggqc 

ZZJ ZZJ ZD ZD 


qqccatqtqc 

ZD ZZJ ZZJ '-'ZD 


attgcagcct 


ggccgtcact 


34740 


cctcggtacg 


tgttttggac ' 


ttaaacgctc 


cggatgttta 


ctgagtgctt 


gattaataac 


34800 


atggaaggcc 


tggtctcatt 


gctgtgggag 


tgaaggatgc 


acagccaggc 


ctgacatgat 


34860 


gagaacaaga 


acctggagtc 


tcgctgcctg 


ggtggtaatc 


ctggccctgc 


cacttagcaa 


34920 


ctgtgtgact 


gtagccaggt 


cacttaattt 
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cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 

ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 

tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 

gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 

ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 

ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 

agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 

taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 

gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 

ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 

gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 

ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 

agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 

aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 

cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 

tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 

tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 

tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 

ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 

tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 

ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 

ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 

gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 

aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 

cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 

tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 

gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 



36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 



BNSDOCID: <WO. 



0177327A1_I_> 




WO 01/77327 PCT/US00/16951 

124 



tgagcccagg 


gaggtcagag 


acggaggtgt 


gtgtgtgggt 

— ' zj zj Zj ZJ Zj 


gtgaccctgc 


gcagttccct 


38460 


gccggctgta 

ZJ ZJ ZJ ZJ 


gttttttgca 


ttcgcttaat 


gtttctcgtg 


gaggaaattg 


tgeatgagea 


3 8520 


aatgtgaaac 


cqtqctqtqc 

13 _3 ^ ' Zj ZJ 


tcaaattgtc 


ctaatacatc 


attgeattgg 


aacagattgg 


38580 


ctttnttttt 


tttttttttt 


tttttttttt 


tttqaaatqq 


agtctcactc 


tgtcaccagc 


3 86.4 0 


ctggagtgca 


gtggcatgat 


cttggctcac 


tgcaaccttt 


gcctcctatg 


ttcaagtgat 


38700 


tttcctgcct 


cagcctcctg 


agtaactggg 


attacagggc 


atgagccacc 


QCQQCCQQCC 


38760 


agatttgcat 


ttttgaaaca 


actgctaggc 


tqqqcqcqcrt 

w 333 v '3^33 ^ 


ggctcacacc 


tgtaatccca 


38820 


cr c a c t cr t acrq 


aggc cgaggc 


aggtggatca 


cetera aaf~ 


crcr crcr t f~ n era cr 

yyyy L-^-y ^y 


c*. v*» w c*.y w u» y y 


JOOOU 


ccaacatggt 


gaaaccccgt 


ctctactgaa 


tatacaaaaa 


t cacrc t acr crt 


cr tcrcrtt era c: crcr 
y ^yy ^yy^yy 


3 8940 


gtgcctgtaa 


tcccagctac 


tcaqqaqqct 


qaqqcaqqaa 


aattgcttga 


acccaaaaaa 


39000 


cagaggttgc 


qqtqaqccqa 

Z3 ZJ Zj ZJ ZJ 


gatcacacca 


ttgcactcca 


Qcctcrcrcicaa 


caaaacrcaaa 


39060 


actccatctc 


aaaaaataaa 


aaa t agaaaa 


acaagtgctg 


tageggaagt 


gagcactttg 


39120 


cggagtcagg 


cttctataac 


ctattccaca 






r«t* r*a crcr r^cr* a 


•jQi on 

J 3 i.O U 


c c t crcr acr t c t 


ocaacataao 


crcacaacaaa 


t teat t acr fcer 




yyaL.ayyi^L u 


o z7 ^± u 


arcrc t c c t aaor 


cagccttct t 


ttacaaaaac 


tcrcacracrccc 


crn c f* crt" a t" ccr 


i— CLy ua.u t_ <_ l. y 


-J 3/ J W w 


qqaqqccqaa 


qt qqq t qqa t 

zj *~ zy ZD zj "jj*** 


cacgaggtca 


ggagttcaag 


ac c acre c t acr 


ccaacatoat 

t-A- *w Wi Ik* w 


39360 


gaaaccccat 


ctctactaaa 


tatacgaaaa 


ttaqctqcrqt 

w ^^3^ ^■zjzJzj ^ 


crtaataocac 


acacctataa 


39420 


tcccagctac 


tccrcrcfaqqct 


Qaqcrcaqaat 


tcrct taaacc 


toaoa crcr t* crcr 

u yyy ^yy L yy 


aaattacaaa 

cxy y y Vrfay y 


3 94 80 


gatctgagac 


catgtcattg 


cactccagcc 


taaacaacaa 


«y ay ci^y >w 


w CL I— 1 — • CI d d d 




aaaaaaaacc 


tacagagcca 


cacggcctct 


tt ctccaccg 


aatattaata 
a z) '-•y ^ ^yy *-y 


*— y y y "-^3 ^-y 


39600 


tgttattgtg 


gtgaaatctt 


ggtactttct 


tcraqQcacraa 


a era acre t aaa 


cacctaaaaa 


39660 


gactttcaca 


tqqqtcqcca 


tqtccqccqrt 


c era 1 1 1 cere t 


at tat act cc 


ccatctgaag 


39720 


qctqqtqccq 


tccaqacaqq 


ctggacgccc 


ctttccacca 


aatccttcct 

w \«» w l«* i-* t*» 


cccacaacaa 


39780 


tttctagtta 


cgttgtactg 


tgaggtctgt 


gtccttggtt 


gatggcaaaa 


gtcagccgaa 


39840 


ttgaaattca 


gagccatgcc 


tggctccctg 


gagcttctct 


cctgggcagc 


tgtgatcatt 


39900 


gcctctgctg 


tggtgtgggt 


ggtggaaatg 


gattcctttc 


atettgettg 


ctacaggtga 


39960 


ctgtcacgtg 


gagtcctttg 


gagagaggga 


cgtgttaatt 


gatggatgtg 


gctcccatgc 


40020 



BNSDOCID: <WO 0177327A1_I_> 




WO 01/77327 PCT/US00/ 16951 

125 



tgagaaagct 


cctgggcgta 


cattgectta 


gagtttcatt 


qqaqctqcqt 


tcttttatgg 


40080 


tgtctgctag 


gcagaagtqa 


tgaagacttg 


gaagaaaacc 


cagaaggttt 


tccacttaat 


40140 


ttggaaaatg 


tgcttttccc 


ctcctgtgtc 


fcfc ttgctaag 


gtccagcctc 


ctgcagcctc 


40200 


cccgctctgt 


qqactctaac 


tttgattctt 


tattaggagt 


ccccctgct c 


ccccaaaaga 


40260 


tggtgtcfcaa 


attatcatcc 


aa t fc crcf c c era 


ggttttgttt 


tctattaatt 


gtttttattt 


4 032 0 


tttattgtgg 


taaatttata 


taacataaaa 


tttgccattt 


taattgtttt 


attattatta 


40380 


tttttgagac 


agggtctcac 


cccagtgccc 


acrcrc tcrcracrt 


crcaataatac 


gatcatggct 


40440 


cactgcagcc 


tcagcctcca 


gqcrctccacrt 


gatcctctca 


cctcaacctc 


t c taataace 


40500 


aaaactacaa 


acatacacta 


ccacatctaa 


ctaatttttt 


atattttttt 

t_> d W> i» 1» V» l» b> w- 


tttattataa 


4 0560 


acraccccrcta 


tottacccaa 


actaatc t ca 


actcctaaac 


tcaaorcatc 


ct eeeaeot e 


4 0620 


accctcccaa 


aatacfcaaaa 


ttacaaacat 


gagccacaac 


acrcaaccat 


tttaattttt 


40680 


tttttttttt 


ttaaaataaa 


gtctcactct 


atcgcccagg 


ctggagtgca 


qtaqcqtqat 


40740 


atcaactcac 


tgcaacctct 


cr c c t c c c acrcr 


ttcaagegae 


tchcctacct 


cagcctcctc 


408OO 


c caaat acre t 


aaaattacaa 


ataeccatca 

w 7j w tA. w u* 


eta tercet era 


ctaattttta 


tattttttaa 


40860 




atttcaccat 


cr t tcrcrcea crar 


ctcfatctfcoa 


Q. w l_ ^ — l_- CJ. CI V— ^ 




40QOQ 


cccgcctcgg 


cctcccaaaa 


tcrctcracratt 


acaaatataa 


accaccatac 


ccaacct ttt 


40980 


tttgtttttg 


aaacacrcrcrtc 


ttgccctgtc 


acccagactg 


qaqtqcaatq 


ataaactctt 


41040 


ggctcactgc 


agcctccgcc 


tcccaggctc 


aaattataca 


cctccacacc 


tqqetaacta 


41100 


tattttatgt 


agagacagat 


ttcaccatgt 


tgcccaggct 


gggcttgaaa 


tggactcaag 


41160 


cagtccaccc 


acc t cage ct 


cccaaagtgc 


tgagattaca 




accgcaccca 


41220 


geccatttta 


cchattctac 


agttgacagt 


tcaatacrr at 


tcaatcaatt 


cacaaaataa 


412 80 


ccatcactgc 


cattcatctc 


cagactactt 


caccttctcg 


gcagatgtcc 


gaaactgtcc 


41340 


gcattgaaca 


cactcctcat 


ctccctctga 


cagccaccat 


tctactttgt 


atctctctct 


41400 


gccttctcta 


ggtacctcat 


gtaagtggaa 


ttataccaat 


atttgecett 


gtgtgactgg 


41460 


cttctttcat 


gtgacatggt 


gtcctcaagg 


ttcatctgtg 


ttatagcctg 


tgtcagaatt 


41520 


tccttcctta 


aagcctgaat 


aataacccgt 


tgtaaaggct 


gggcgcggtg 


gctcacaccc 


41580 


tctaatccca 


gcattttggg 


agtccgaggt 


gggcagatca 


cttgaggtca 


ggagtttgag 


41640 



BNSDOCID: <WO 0177327A1_I_> 





WO 01/77327 



PCT/US00/16951 



126 



accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 
tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 
cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 
agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 
cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 
gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 
tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 
attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 
ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 
gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 
agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 
gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 
cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 
tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 
cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 
aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 
ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 
cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 
tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 
cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 
gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 
ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 
gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 
ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 
gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 
cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 
tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 
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ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 
gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 
cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 
tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 
ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 
gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 
gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 
cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 
a ggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 
caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 
tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 
actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 
tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 
gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 
cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 
ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 
attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 
tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 
tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 
attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 
ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 
cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 
aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 
ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 
gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 
gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 
catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 
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tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 
acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 
ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 
acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 
tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 
tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 
ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 
ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctc't 
gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 
ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 
tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 
tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 
gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 
cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 
tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 
cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 
tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 
tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 
aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 
tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 
gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 
ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 
gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 
atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 
gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 
cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 
aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 
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cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 
cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 
ggtgggcgtc tggggagggg tggaggcagc cgtcagcgcc ctggcttgct cttcctccct 
ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 
tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 
actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 
acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 
aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 
gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 
tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 
agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 
tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 
gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 
acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 
ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 
tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 
gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 
gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 
agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 
actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 
tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 
gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 
aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 
ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 
cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 
tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 
gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 
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tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 
tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 
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agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 
ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 
ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 
tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 
cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 
ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 
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tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 
aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 
ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 
ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 
agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg 
aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 
ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 
cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 
aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 
cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 
ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 
ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 
ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 
ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 
tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 
gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 
aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 
ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 
tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 
cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 
ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 
caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 
ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 
ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 
cgcaggggct ggatcagagc ctgggatggg cagggtgagc ctcctgacct ttaacccagt 
ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 
ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 
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cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 
cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 
cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 
tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 
gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 
cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 
tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 
ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 
agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 
cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt ttfcttttttt 
ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 
tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 
tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 
tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 
tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 
cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 
ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 
gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 
ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 
acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 
agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 
gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 
gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 
acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 
tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 
ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 
atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 
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ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100 

tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160 

aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220 

aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 82 80 

ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 8340 

aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 84 0 0 

gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460 

gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 

tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580 

gatggtactt gtcaaacaag gcctggctaa atttctatat ttccttcaag tagaagatgt 8 640 

gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 87 00 

aaaag 8705 
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<212> DNA 

<213> Homo sapiens 
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gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 
ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 
tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 
gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 
ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 
cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 
tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 
catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 
gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 
ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 
aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 
cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 
cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 
cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 
cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 
ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 
ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 
gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 
ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 
ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 
acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 
tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 
tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 
cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 
gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 
aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 
ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 
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caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 
catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 
ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 
cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 
gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 
gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 
gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 
ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 
gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 
ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 
tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 
cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 
caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 
ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 
cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 
ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 
aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 
ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 
cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 
aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 
ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 
cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 
tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 
gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 
ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 
catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 
cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 
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gtatttttag 


tagacagggg 


9000 


gtttcaccat 


gttggtcagg 


ctggtctgga 


actcctggtc 


tcacatgatc 


cacctgcctc 


9060 


ggcctcccaa 


agtgttggga 


ttacaggcgt 


gagccaccgc 


gcccagcctc 


tgttgagcat 


9120 


att-ttgaggt 


tctcttggtg 


ccagtgatat 


gtacatgtgt 


ccccatcgca 


ccatcgtcac 


9180 


ccattgaggt 


gacattggtg 


cctctcctcg 


gggtggatgc 


ctccctctgt 


ttccagcaac 


9240 


ttctgaagga ttttcctgag ctgcatcagt 


ccttgttgac 


gtcaccatcg 


gggtcacctt 


9300 


tgctctcctc 


agggctccca 


ggggaggccc 


gaatcaggca 


gcttgcaggg 


cagggcagga 


9360 


tggagaacac 


gagtgtgtgt 


ctgtgttgca ggatttcaga 


ccctgcttct 


gagcgggagg 


9420 


agtctcagca 


ccttcagggt 


ggggaaccca 


gggatggggg 


aggctgagtg gacgcccttc 


9480 


ccacgaaaac 


cctaggagct 


gcaggtgtgg 


ccatttcctg 


ctggagctcc 


ttgtaaatgt 


9540 


tttgtttttg gcaaggccca 


tgtttgcggg 


ccgctgagga 


tgatttgcct 


tcacgcatcc 


9600 


ccgctacccg 


tgggagcagg 


tcagggactc 


gcgtgtctgt 


ggcacaccag gcctgtgaca 


9660 


ggcgttgttc 


catgtactgt 


ctcagcagtg 


gttttcttga 


gacagggtct 


cgctcgctca 


9720 


cccaggcgag 


agtgcagtgg 


cgcaatcacg 


gctcgctgta 


gcctcaatct 


ccctgggctc 


9780 


aggtgatcct 


cctgcctcac 


cctctgagta 


gctgggacta 


cagacacata 


ccaccacacc 


9840 


cagctagttt 


ttgtgtattt 


tttgtggggg gagatggggt 


ttcgctgtgg 


tgcccaagct 


9900 


gatctcaaac 


tcctgaggca 


caagcgatcc 


acctgcctcg 


gcctcccaaa gtgctgggat . 


9960 


gacaggcatc 


agccgtcaca 


cgcagctcaa 


tgattttatt 


gtggtaaaat 


aaacatagca 


10020 


caaaattgat 


gattttaacc 


attttaaagt 


gaacagttca 


ggctgggcgt 


ggtggcttat 


10080 


gcttgtaatc 


ccagtacttt 


gagaggctga 


ggtgggcaga 


tcacctgagg 


tcaggagttt 


10140 


gagaccagcc 


tggccaacat 


gatgaaatcc 


agtctctact 


aaaaatacaa 


aaattagccg 


10200 
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ggcatggtgg 


caggtgcctg 


taatcccagc 


tactcgggag 


gctgaggcag 


gagaatcgct 


10260 


tgagcccggg 


aggtggaggt 


tgcagtgatc 


tgagatcatg 


ccactgcact 


ccaatctgtg 


10320 


tgacagagca 


agactctgtc 


ttgaaaaata 


aataaataaa 


aaaaatttta 


aaaagtgaac 


10380 


aattcagggc 


atttagtatg 


aggacaatgt 


ggtgcaggta 


tctctgctac 


tatctacttc 


10440 


tagaacactt 


tcttctgccc 


tgaaggaaac 


cccatgccca 


ccggcactca 


cgcccattct 


10500 


cccctctctc 


ccagcctctg 


tcaaccacta 


atctactttc 


tgtctctggg 


ggttcacttc 


10560 


ttctggacgt 


tttgtgtgac 


tggaatcctg 


caatatgtgg 


tccctgcgtg 


tggcttcttt 


1O620 


ccatagcatt 


gtgttttcca 


gattcaccca 


cacattgtcg 


cacgttatca 


gaatctcatt 


10680 


cctgactggg 


tgcagtgggt 


taggcctgta 


atcctaacat 


tctgggaggc 


caaggcggga 


10740 


cgatcacttg 


aggcaggagt 


ttgagaccag 


cctggccagc 


ctagcaagac 


cccagctacc . 


10800 


aaaaaatttt 


aaaagttaac 


tgaacgtggt 


ggtggtgggc 


acttgtggtt 


cccagctacc 


10860 


tgggaggctg 


aggtgggagg 


atcgcttaag 


cccaggaggt 


caaggctgca 


gtgagctatg 


1O920 


atcgcaccac 


tgcactccag 


cctggacaac 


agagcaagac 


cctgtctgaa 


aaaaaaaaca 


10980 


aaaaaaaaag 


ttcctttctt 


tttgtggctg 


gatgacatcc 


cattgtatgg 


ccacagcaca 


11040 


ttttgtttgt 


ctgtttatcg 


ggtggtgggc 


agtggtttcc 


accttttgtc 


tcctgtgaat 


11100 


aatgctgctg 


tgaacatttg 


aattcaagtt 


tttgtttgaa 


cacctgttgt 


gaattatttg 


11160 


gafcatatgtg 


taggggtagg 


attgctgagt 


cctatggtaa 


tgttaggttt 


gacttactga 


11220 


ggaaccatta 


aactgttttc 


aacagtggct 


gcgccgttct 


gcatccccac 


cggcagtgtg 


11280 


tgagggttct 


gactttacct 


cctcacaaac 


gcttcttttc 


catttaaaaa 


aatattcagc 


11340 


caggtgctct 


ggctcacgcc 


tgtaatccca 


gcactttggg 


aggccgtggc 


gggcggatca 


11400 


cctgaggtca 


ggagttcgag 


acgagcctgg 


ccaacatggt 


gtaaccccat 


ctctaccaaa 


11460 


aatataaaaa 


ttagccgggt 


gtggcagcgg 


gcgcctgtaa 


tcccagctac 


ttgggaggct 


11520 


gaggcaggag 


aatcacttga 


acccgggagg 


cagaggttgc 


agtgagccaa 


gatcgcgcca 


11580 


ctacactcca 


gcctgggtga 


caagagtgaa 


actccatcta 


aaataaaaca 


aaaataaaaa 


11640 


taaataaaaa 


tttattaaaa 


cattcatcac 


agccagccta 


gtgggtgtcc 


catgtggctt 


11700 


tgcctcgcat 


ttccctgata 


actaggatgc 


tgagcgtctt 


gtcccaggct 


tgccacacct 


11760 


cagcactttg 


agatacgtcg 


cacagtcccc 


atttgcgaac 


gagaaatgag 


gtttagggaa 


11820 
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cagcagctgt 


gtcatgtcac 


acagcgagca 


gqqqq fc C t C t 

zj 


qaqccqtctq 


accccacagc 


11880 


cgaccaagct 


ccaatcctta 


ccgcctccta 


gtgttgtgga 


tgtagcccag 


ggtgctccca 


11940 


catttttcag 


atgagaacac 


cgaagctcaa 


aacaggagcg 


ttttgtccac 


attggataca 


12000 


cgatgtctgt 


ggtttggtcc 


tgaagtcact 


t tat ate tea 


gtqqtccaqa 

3 3 3 3 


C tqqaqtaqq 


12060 


acagggggtt 


ctggggaatg 


gqqaaqqtqt 


ctcaqgtqaa 


aqoaacrcraat 


tccaaattct 


12120 


ccatactgtc 


cttgggaagt 


tagaagactc 


aqaqqqtcta 

3 *"*3 3 3 3 


gcaaagtcag 


acaaagcaag 


12180 


agaaatgcag 


tcaggaggaa 


gcggagctgt 


ccaggaacag 


gggggtcgea 


ggagctcacc 


12240 


cccaggaact 


acacttgctg 


gggccttcgt 


gtcacaatga 


cgtgagcact 


qcqtqttaat 

3 W 3 W 3 V w^jfc* v» 


12300 


tacccacttt 


tttttttttt 


ttgaggtgga 


gtctcgctct 


cttgcccagt 


c tqqaqtqca 

w 3 3 ^^33 3 w *" 4. 


12360 


gtggcacgat 


ctcggctcac 


tgcaagctct 


gcctcccggg ttcatgecat 


tctcctgcct 


12420 


cagcctcccg 


cgtagctggg 


actacaggcg 


cctgccaccg cgcccggcta 


atttttatat 


12480 


ttttagtaga gatgggattt 


cactacatta 


gecaggatgg 


tctcgatctc 


ctgacctcat 


12540 


gatccgcccg 


tctcggcctc 


ccaaagtgct 


gggattacag gcgtgagcca 


ccgccrccccrcr 

v - v *3 , -'3)*- , " l -3)3) 


12600 


cccgatttcc 


cactttaaga 


atctgtctgt 


acatcctcaa 


agccctatac 


acaatcrctacr 

» ^ *^3 *"*331 w 3 3 


12660 


gttgctatag ggaatatgag 


gcttacaggc 


catggtgctg gacacacaga 


a crcrcr a c crcr a cr 


12720 


gtcaggaggt 


agaagggcgg 


aqaqaqqgaa 


caggeggagg 


tcacatcctt 


aacttfccaaa 


12780 


atgggccagg 


gagagacacc 


ctctgagcat 


ggtaggacag 


gaaagcaaga 


t tggaacaca 


12840 


ttgagagcaa ccgaggtggc 


tgggcgtggt 


ggcttacgcc 


tgtaatccca 


acactttgga 


12900 


aagctgaggt 


gggtggattg 


cttgaggcca 


ggagttcaag 


accagcctgg 


ccaacatggt 


12960 


gagaccccgt 


ctctactaaa 


tatacaaaaa 


ttagecagge 


gtgatggtgc 


atacctgtaa 


13020 


tcccagctgc 


ttgggaggct 


gaqgc aqqaq 

.-3 Z3 3 3 3 


aattgcttaa 


acctgggagg 


ccfqaqcrttac 

^^3 3 3 


13080 


agtgagccga gatcccgcca 


ctgcactcca 


gcctgggcca 


cagagtgaga 


ctccatctca 


13140 


aaaaaaaaaa 


aaaaaaaaga 


taaaaagacc 


aaccgaggaa 


ttgaagtggg 


ggggegtcac 


13200 


agtagcagaa 


gggggatcgt 


ggagcaggcc 


accctgtggt 


catgeactgg 


aagctcatta 


13260 


cctgacgatt 


tggagctcat 


cactgggggc 


ctaaggagaa 


tagatactga 


aggatgagga 


13320 


gtgatggcgc 


ggggcacggg 


tgtctttggt 


ggecagaact 


tggggactgc tggggtgcct 


13380 


cactgcaggc 


cttctcagcg 


ccctttatat 


gcttacacag gctgtttcta 


agagggggat 


13440 
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acattgcata 


agcgttttca 


gactacctca 


tcatgggtcc 


ctttctttac 


cctctgtggc 


13500 


cctggtggcg 


cactctctgg 


gaaggtgcag 


gtggatgccc 


agacccgccc 


tgccatccac 


13560 


ctgcacgtcc 


agagctgact 


tagcctcgag 


attgctgctg 


gcacctcctg 


ccccgggaca 


13620 


cctcggatgt 


gcccgtggag 


atgctggctc 


tgtgttttct 


gctggagttt 


ggtgcgtctt 


13680 


ttcctcctgc 


aagtggccac 


cgctcttggg 


tatgtcctca 


ggcttctgcg 


agtcatggct 


13740 


gcttctcagg 


tccttgccca 


gcgccaggag 


caaaccctcc 


tggcactttg 


ttcaggggtg 


13800 


gatgcgccag 


tgttcctgct 


gtggaccccc 


atctcacatg 


agggtcttgg 


gcctgcaggc 


13860 


tcgttcagga 


aacacccgct 


gagtacgcag 


tgtgtgccag 


ctgtgtccca 


ggcaatggcg 


13920 


gggacagtgg 


ctgctgctgg 


ggttgtggtg 


gcttctgggg 


actctgggga 


cagctgaggt 


13980 


gcaaggagcc 


acggctcctt 


gaggatgcag 


ttggactcca 


ggtggaaggg 


atggttgggg 


14040 


gaggtataaa 


tggggtcagg 


gaggagacac 


atttggaaca 


atgggaacat 


ttttaagatg 


14100 


ctatgtcggg 


aggcaacaag 


gtggccaacc 


caggtgctga 


ggagcccaca 


ccagccctgg 


14160 


acgtgttttg 


ccgctcacct 


ttgctgggga 


gtggtgggag 


agaggattcc 


gttccacgtg 


14220 


gtggtgtgcg 


cagctgggct 


gtgtggagct 


gggcgctagg 


aggaaggtgc 


tttcfcgcggg 


14280 


gctagccggg 


ctctgccttt 


gaacacaatc 


aggctccagg 


ttttcagcat 


ccagtgcatg 


14340 


agaggacttc 


acgggcagct 


gtggctgatc 


ccttgatgaa 


ttgggagaag 


aacaaaggtc 


14400 


tatgaaatga 


ggtttcatgt 


agatggcatt 


agagacgccc 


acaacagatt 


tacagagtgg 


14460 


agcggagacg 


gcggatgggt 


ctgggaggcc 


cctcctgctg 


gccttgactg 


tgacagctgt 


14520 


cctgggaatc 


agcttccagg 


ccgccccagc 


agcctgactg 


acacacacag 


gggttttagc 


14580 


cccatcctgc 


gaccagctgt 


tgccatcatc 


agtgacagct 


gggagtggcg 


gtggttccag 


14640 


ccctgggcac 


cctccccacc 


tgctggggcc 


cacccagggc 


agtcctgaca 


cctacaggtt 


14700 


gcttggagcc 


gcatccgagt 


cctgccccac 


cacgtgtgaa 


gcccgagtgg 


tcgtgggctg 


14760 


aggtcccctg 


attgcatccc 


cacttccctt 


ctgcttcaca 


tagctgcctc 


ttctcaccgt 


14820 


ttttccagcc 


tcctgggcta 


ggaattccag 


tgttgtgctg 


gctttgcccc 


aggacacctc 


14880 


cttagccctc 


ttcctgagtc 


tagagccccg 


ggggttggaa 


gttctggccc 


ctgggacacc 


14940 


tgcagccaca 


ctcagcttct 


cctgtgagcc 


tccagcatgt 


cccctcagga 


ccaagccctc 


15000 


acgttcttgc 


ctccccgccc 


acctgggctc 


agccagggga 


aggcctggct 


gggagcgtct 


15060 
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cccctctgcc 


ctgcccttct 


cccctctacc ctgcccttct ctcctctgcc 


ccgccaftggc 


15120 


ttttatatcc 


tgtgccacaa 


gacatggctg tgtgtgaaag tggcagggtc 


tggcatctct 


15180 


gtgggtctct 


gaggcccacg 


ctccagtgcc actcttccca cccgctggcc 


gtgccctcat 


15240 


gctggaggga 


cagcccagcc 


ctctcccgaa ccccagcccc atgtgcccag 


ctgcccccgg 


15300 


ccctctcccc 


tggaagccgg 


ggtcactcca gccgtatgcc atggtgggga 


catcctgctt 


15360 


ccttggcctt 


ccagggaagg 


tcctctttcc aaatggcgac acctggtccc 


tgcctggagg 


15420 


ctggaagctg 


tggcccttgt 


atgcccctcc agggtctgtg cgctcggttg 


gcccgagttc 


15480 


ccatcaccgt 


catcatcacc 


atcatcattg tcatttcgct tgtctgtgag ccggcctggt 


15540 


ctcccagagc 


agagaccctc 


tgaggtccag cctgagttgg ggtctccgtg 


ctgacccctg 


15600 


acggggactc 


aggacgtacc 


aggtctgggt caggagtgac ccccaaacct 


cgtgcccttt 


15660 


gacaggcacc cctgactttt gctaagtggg tggaggtgac atcacttaca gcgggagtga 


15720 


tgggacaggg tctgttggct 


gcactgtgct cccagggatc tggggagagg 


ctatatccct 


15780 


gggctttggc 


actgcagagc 


tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg 


t o t a t" cr 1~ a fc cr 


15840 


tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct 


ttttttatta 


15900 


catgaagcaa gataactgtt 


gctgtttcct tttgggtttt gtgttcaaca 


gagtggggta 


15960 


cttcttccct 


cagacaacag 


aactctcccc tttaaacacg tgctgtcaga 


gggtgggtct 


16020 


tgggctcatg 


tctgtttgca 


cagccgagtc agaggaaaca cagggttctt 


cataaaaaca 


16080 


ctgcacagca ggcgactgtc 


cagagtcagc ctgcaggacg gcagcagccc 


tgcccctcag 


16140 


agcacagcta gggtgggctg 


ctttgggatc tcccgtcatt ccctcccagc 


tggcagccgg 


16200 


cggccggccc 


attccttggt 


gtgctggtca ggggggcgtg cgcctgctct 


gctcaccctg 


16260 


ggaatgggac 


agaagctggc 


agctcggaga ggacagggct ggacccttgg 


gtggcctctg 


16320 


gctggaccat 


ctcattgtcc 


tcagacacag cctctcgggt ctagtttcat 


ttcctgaaaa 


16380 


acaagtgcac 


agaactagag 


caggagtcga gagctacggc ccccgggcca 


gatccagccc 


16440 


tgccacctgt 


tttcacacca 


tgctcaagct gagtgggttt tacatttttt 


aattacttga 


16500 


aaaaaaaaaa 


gccaaaggag gtttcatgac ccatgaaaat tatatggaat 


tcaaaaaaaa 


16560 


aaaattatat 


ggaattcaaa 


tttcagtgtc cataaataat ttcttgagac 


agggtctcgc 


16620 


tctgtcaccc 


aggctggagt 


gcagtgctat ggcatggctc gctgtaccct 


tgacctccca 


16680 
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ggctcaagcg 


atcctcctgt 


ctcagcctcc 


tgagtagctg 


ggactacggg 


tgtgtgccac 


16740 


caagcccggc 


taattttttt 


ttaattttag 


taaagacagg 


gtctttctat 


gttgcccagg 


16800 


cttttctgga 


actccatctt 


ggcctcccaa 


agtgctggga 


ttacaggctc 


gagccacgga 


16860 


gcccagcctg 


tttttgtttt 


ttcactgata 


aagttttgcc 


gggfcgtggta 


gtgtgtgcct 


16920 


ctagcgattt 


gggaggctga 


ggtgggagga 


tcgcttaagc 


ccaggagttt 


gaggctgggc 
-j — * — i —j -j —j 


16980 


tcaagtgatc 


aggaqgtgaa 

ZJ Zj ZJ ZD ZJ 


ctatgatcat 


gtcattgcat 


tccagcctgg 


gtgacagagc 


17040 


aagaacctat 


ctcttaaaaa 


tatatattta 


aaaagtattg 


ggtgtggtgg 

ZJ ~J _* ZJ ZJ -J ZJ 


ctcacgcctg 


17100 


tggtcccagc 


tacttaggca 


tctgaggtgg 


gaggatggct 


tgagcccagg 


agtttgaggt 


17160 


tgcagcgagc 


caagatcgtg 


tcactacact 


ctagcctggg 


tgacagagcc 


cagaccctgc 


17220 


ctctttaaaa 


aaaaaaacca 


aaaaacatgt 


attggaacac 


agccatgcct 


gttcagtcac 


17280 


gtgctctcca 


fcgctgcfcttc 


tgctccagag 


acccttatgg 


cctgaaagct 


gaaaatattt 


17340 


tctatccttt 


acaaaaaagt 


ttgctgacct 


ctgtcctgga 


aaattcatct 


cccaagttct 


17400 


cttccggcac 


tggcgttcct 


gggtgtccta 


aatttggccc 


ctgttatttc 


tgaactctgt 


17460 


tttggctctg 


ttccctccca 


ggagccagga 


caggcacgtt 


ctctgcafcct 


tgtcccctga 


17520 


cgcccaqagg 


cttgqctcgq 

-3.3 Zj ZJ 


ctcaggcatzt 


cttggaaata 


tctggctcca 


ggaaaggcag 

Zj Zj ZJ J ZJ 


17580 


aggcctcctg 


agtcagccca 


gagggaacct 


gccccaggtc 


t QqQQQ3LQQC 

ZJ ZJ Zj ZJ Z3***Z2 ZJ 


ctgacccagc 


17640 


agagtqqctt 

zj zj 3 j " 


ttqccqatqq 

zj ^ zj zj Zj 


qttgqqccqq 

Zj zj zj Zj Zj Zj* 


tcaagatgtg 


ctgaaagttg 


tcctcagaag 


17700 


gccactttgg 


gattccttcc 


tccagtatta 


gagcaactga 


gagctgctca 


ttgcaagcct 


17760 


gatgttttcc 


cagttggccg 


ggtccaccgg 


gtgccctggg 


attctgggat 


ctgggtggaa 


17820 


agtagggggc 


ttgggggagt 


gtcctgggtt 


ctggaatcca 


ggtggcaagt 


ggtgaggttc 


17880 


agggagtggc 

ZJ ZJ Z) ZJ ZJ Z) 


ttctgagcca 


ccataggggt 

Zjzjzizj 


ctctgtggga 

ZJ Zj ZJ ZJ 


ggctctgccc 


atccaggaga 


17940 


ttccgcaggc 


cctgccggcc 


cagagccagc 


gtcttgcgct 


tgccgaggct 

ZJ Zj 


acagccagcc 


18000 


c'cagccgggt 

ZJ ZJ ZJ ^ ~ 


ggaacagccc 


gtcgcctcct 


ctcactttgt 


tttggggcca 

^ ZJ ZJ —J ZJ 


cctgggagtg 


18060 


tggagcaagg 


gtagagaggg 


aggaagtggc 


tgccggccgc 


tgcccagcac 


ccttgtttgc 


18120 


cttgggccct 


ctgtgggctc 


ctttttattg 


ctcttcaatg 


aagccaggga 


aatggacttc 


18180 


cttgcctcac 


ttcagttcaa 


catgtctgga 


agtttggtat 


taaaattaag 


aaagtgtgga 


18240 


aatagagcaa 


gaagagaaaa 


atctctccaa 


gagataatag 


tgacctcfcga 


gctgggcgcg 


18300 
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gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 
tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 
aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 
agctaaggca ggagaatcgc ttgaacctgg gaggcaaagg ttgcagtgag ccaagatcac 
gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 
taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 
ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 
gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 
ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 
tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 
gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 
cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 
gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 
gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 
accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 
ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 
tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 
ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 
tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 
catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 
gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 
ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 
tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 
attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 
gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 
ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 
tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 
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tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 
atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 
atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 
tacag.ttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 
tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 
ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 
acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 
gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 
caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 
aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 
ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 
tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 
caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 
tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 
cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 
gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 
cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 
aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 
tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 
actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 
aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 
tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 
gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 
cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 
agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 
acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 
ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 
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tggccaaggt 


cgtcggtgag 


tccggggggt 


cccaagccat 


ggctcagcca 


tgcagacttg 


21600 


catgaggagg 


aagtgacggg 


tccatgcctg 


ggcataagtg 


ttgagctcag 


gtgccccgac 


21660 


ctggggaagg 


gcaggacagg 


aaaggtgaca 


gtatctggcc 


aaggacagat 


gggaagggac 


21720 


caagggagct 


gattagggag 


tggttatgga 


ctaggaatgt 


cggtaacaat 


ggttagaaag 


21780 


tgactaacat 


ttgttgagca 


cctgctgtgt 


gcccggccct 


ggc egg gage 


cttcgtgccc 


21840 


acagtgaccc 


cgtctgcaaa 


tgtagttcct 


tgccctactc 


qcactqqgga 


qcaqqaccrca 

ZZJ ^ ZD ZD w — ^ w w*. 


21900 


gagccgtgca 


tctcacaggt 


gccaagctca 


ggactccctc 


ctgggtctgc 

— ' i ZJ ~J 


ctgggctggg 


21960 


ctgtgcttgt 


tgcccctgtg 


gcccacgcat 


gtgcaccttc 


cacctgaaag 


ccaggatctt 


22020 


caggacgctc 


cccgaggagg 


tcgttgtctg 


gcacaatgat 


ttgtctcttc 


ctgaaaaggt 


22080 


gacagagtta 


cactggagag 


agcagcatcc 


aggtgcggca 

—J ZJ ZJ ZJ ZJ 


gggacaaqcc 

ZJ ZZ3 ZZJ ZJ ZJ 


tqqqqctcqc • 

zj zj zj zj 


22140 


gggcagggac 


tctgtgtcct 


gccggggtcc 


cacactgcac 


ctgcttgtca 


gaggcactca 


22200 


gtcaatcttt gctgatgaag gatgagagga 


cagaggacgt 


oatcrcttqet 


crctqcattac 


22260 


ctgcagtcct 


gggtgagatg 


cccgggttga 


ctctgctgcc 


c cr t c Qcrcr t qct 


atgtgatgtc 


22320 


agatccccgg 


ctttaaaata 


cgagggagct 


gggaattgag 


qqaqcaqqtt 


qqqqcaqaaa 

ZD ZD ZD ZD ZD 


22380 


gcacagcccc 


gtggaagcct 


ggagctgagg 


caqtqtqqqc 


gacccctgga 


qcaqtqaqtq 


22440 


cttccttcat 


ggccttcatc 


gcaccctgca 


gtcctcatgt 


acrcrcrcrafccfcc 


atccatgaat 


22500 


ttagttttcc 


cagcctcctt 


taaaaacgcg 


ttcatgctgg 


QQCcqcrqqca 

33 w ^3333*"* 


qfcQCaQtCTQC 

ZD ^ ZD ^"-ZJ ZJ ^ 


22560 


tcacatctga 


aatcccacca 


ctttgggagg 




tggatcatga 


qqtcaqqaqa 


22620 


tcgagaccat 


cctggctaac 


aaggtgaaac 


cccgtctcta 


ctaaaaatac 


aaaaaattag 


22680 


ccgggtgcgg 


tgg.cgggcgc 


ctgtagtccc 


agctactcgg 


qaqqctqaqq 


caggagaatg 


22740 


gcgtgaaccc 


gggaagcgga 


gcttgcagtg 


agccgagatt 


gcgccactgc 


agtccgcagt 


22800 


ccggcctggg cgacagagcg agactccgtc 


tcaaaaaaaa 


aaaaaaaagt 


acaaaaaaaa 


22860 


aaaaattagt 


ctgggtgtgg 


tatcacgcgc 


ctataatctc 


actactcgag 


a era c t qaqqc 

^ZZjZZj *—3 *«*^3^3 


22920 


ggagaattgc 


ttgaacccag gaggtagagg 


ttgtagtgag 


cccgtatcgt 


accactgccc 


22980 


tccacctggg 


caatagagcg 


agactctgtc 


tcaaaaagaa 


aaaaaaaaaa 


agaacattta 


23040 


tgccaggtgt 


ggtggctcat 


gcctgaaatc 


ccagaacttt 


ggaagactga 


ggcaggagga 


23100 


tcacttgagc ccagaaattt gagagtgtct 


tccctgggca 


acatagagag 


acctcatctc 


23160 
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taccagaaaa 


aaaaaaatta 


gcccggcatg 


gtggcatatc 


- 

cctgtggtcc 


cagctactta 


23220 


gggggctgac 


gtggcaggat 


caccfcgagfcc 


tggaggcaga 


ggttgaagtg 


agctgagatc 


23280 


atgccactgc 


actccagcct 


gggtgacaga 


cagagaccct 


gtctcaaaaa 


aaaaaaaaaa 


23340 


aaaaagcatt 


tactatccac 


catggaaggt 


gagactgacc 


tgtgagtgat 


tgttcaaaga 


23400 


acaaaaaata 


aaccccagag 


ataagacaaa 


agggtgcctc 


catgggggtg 


tgatttaaag 


23460 


ctgagaaatt 


gggcttcttc 


cccctcccct 


ctcaccccgt 


ggtttgctaa 


aggagatggg 


23520 


aaaaaggatt 


ctttttttgg 


ctgaaatatt 


taacactaaa 


ttaaagccaa 


ttttaacagc 


23580 


actttggttg 


atgagtgaaa 


ttaacagact 


ggccaaaaat 


aaacgaacgg 


tctgtactat 


23640 


gtgaaaaaga 


ggcagctttg 


gccafcgctgg 


gccaatgtga 


gttttcaggg 


ttgctgggaa 


23700 


tgtctgtgaa 


tcggaggaag 


ggcctagctg 


ggactctcag 


gagccaaggc 


cctgaggggc 


.23760 


aacttgcctg 


gtccctgccc 


tgaggcgttc 


actgctttct 


tcctgggcca 


gatcacaggc . 


23820 


ccggaggctg 


gaccactggg 


ctggcactct 


tgccgagcfcg 


ctccctgact 


tcctgaccat 


23880 


gctcctttca 


gcagccttgc 


tgcactttag 


tttccttgaa 


tgaaaaatgg 


ggatgagaat 


23940 


agctcctacc 


tccaaggtga 


atggagtgag 


ttcggacagg 


tgactccctg 


ggaccagtgc 


24000 


ctggcgcctg 


acaaggtcca 


gtcagagccc 


gcactgctgt 


tactgatacc 


cttggctgta 


24060 


ccaggggaga 


acttggttgc 


catfcgccagg 


tgttctccca 


ccacccccac 


tactgtccct 


24120 


gtttgatgtg 


tggcgggaat 


aaagctgtgc 


acattggagc 


ttttggcaca 


tcctggcttt 


24180 


caggtgaaag 


gtgcgtgtgt 


gtttgagggt 


ttagcctggc 


caacccagcc 


atgaggtcgg 


24240 


acctgacctg 


ggggtgagtc 


ctgagctcgg 


cacccctgag 


ctgtgtggct 


cacggcagca 


24300 


ttcattgtgt 


ggcttggccg 


cacccctttc 


cctgctgggc 


tgttgatgtt 


tagactggag 


24360 


cctctgtgtt 


cgcttccagg 


aaccaacccg 


tgtgcggaca 


ggaacggggg 


gtgcagccac 


24420 


ctgtgcttct 


tcacacccca 


cgcaacccgg 


tgtggctgcc 


ccatcggcct 


ggagctgctg 


24480 


agtgacatga 


agacctgcat 


cgtgcctgag 


gccttcttgg 


tcttcaccag 


cagagccgcc 


24540 


atccacagga 


tctccctcga 


gaccaataac 


aacgacgtgg 


ccatcccgct 


cacgggcgtc 


24600 


aaggaggcct 


cagccctgga 


ctttgatgtg 


tccaacaacc 


acatctactg 


gacagacgtc 


24660 


agcctgaagg 


tagcgtgggc 


cagaacgtgc 


acacaggcag 


cctttatggg 


aaaaccttgc 


24720 


ctctgttcct 


gcctcaaagg 


cttcagacac 


ttttcttaaa 


gcactatcgt 


atttattgta 


24780 
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acgcagttca 


agctaatcaa 


atatgagcaa 


gcctatttaa 


aaaaaaaaaa 


gatgattata 


24840 


atgagcaagt 


ccggtagaca 


cacataaggg 


cttttgtgaa 


atgcttgtgt 


gaatgtgaaa 


24900 


tatttgttgt 


ccgttgagct 


tgacttcaga 


caccccaccc 


actcccttgt 


cggtgcccgt 


24960 


ttgctcagca 


gactctttct 


tcatttatag 


tgcaaatgta 


aacatccagg 


acaaatacag 


25020 


gaagactttt 


tttttttttt 


tttgagacag 


agtcttactc 


tgttgcccag 


gctggagtac 


25080 


cgtagcgtga 


gctcagctca 


ctgcaacctc 


cgcctcccag 


gttcaagcga 


ttcttctgcc 


25140 


tcagcctcct 


gagtagctgg 


gactacagac 


atgcaccacc 


acacccagct 


aatttttttt 


25200 


atatttttag 


tagagacagg gtttcatcat 


gttggccagg 


ctggtcttga 


actcctgacc 


25260 


tcaggtgatc 


tgcccgcctc 


ggcctcccaa 


agtgctgaga 


taacaggtgt 


gagccaccgt 


25320 


tcccggcata 


ggaaaacttt 


ttgccttcta 


aagaagagtt 


tagcaaacta 


gtctgtgggc 


25380 


tggccttctg 


attctgtaaa 


gaaagtttga 


ttggtggctg ggtgcggtgg 


ctcacacctg 


25440 


taatcccatc 


actttgggag gccgacgtgg gcatatcacc 


tgatgtcggg 


acttcgagac 


25500 


cagcctcacc 


aacgtggaga 


aaccccgtct 


ctactaaaaa 


tacaaaaaaa 


aaattaaccg 


25560 


ggcatggcgg 


cgcctgcctg 


taatcgcagc 


tactcaggag gctgaagcag gagaattgct 


25620 


tgaacctggg 


aggcggaggt 


tgtggtgagc 


tgagatggca ccattgcact 


ccagcctggg 


25680 


caacaaaagt 


gaaactccgt 


ctcagaaaaa 


aaaaagtttg 


attggtgtaa 


ccaaagcgca 


25740 


tttgtttatg 


gattgtctgt 


ggcagctttt 


gttctgccga gatgagttgt 


gacagatctg 


25800 


tatgggctct 


aaagcctaaa 


acatgtgcca 


tccgcccctt 


tacagaaaaa gtgtgctgac 


25860 


ctctgttcta 


aagtattgga 


caactacaat 


gtttgctcat 


ttattattct 


atgatttgtt 


25920 


ttctgctttt 


tgttgttgtt 


gttgttgttg 


agatagggtt 


tccctctgtc 


actcaggctg 


25980 


gagtgcagtg 


gtgtaatctc 


agctcactgc 


agcctcgacc 


tcctgggctc 


tagtgatcct 


26040 


ctcatctcag 


cctccctagt 


agctgggact 


acaggcacac 


accaccactc 


ctggctgatt 


26100 


tttttttttt 


tttttttttt 


ttgtggagac 


agggtttccg catgttgccc 


aggctggttt 


26160 




ggctcaaaca 


cccacctcag 


cctcccaaag 


tgctgggatt 


acaggcgtga 


c o o r\ 
2o22 0 


gccaccatgc 


ccagcctatt 


ctactgtttg 


tattacatag ctttaaaaga 


ttttttatga 


26280 


ctttaagtca 


caagggttct 


ttgtagaaaa 


aaatatatat 


ataggaaagt 


ataaaaagaa 


26340 


agtaaaaatt 


gtccataacc 


tctccagcca 


gagacgaccg 


ttgctgacac 


ctcagcatat 


26400 
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tgcctttaag 


tcttttttct 


ctaagatagc 


atttctcttc 


atcacagtca 


tatgctacgc 


26460 


agaattctgt 


atcctgattt 


tttcacttga 


cattacaaca 


ggtatttgat 


ggcgctgtga 


26520 


caaactcttt 


ggcacaatct 


tttaaatgta 


tgaaatactc 


cactgcacag 


atgtttgctt 


26580 


ttaggcttaa 


ctgttctttt 


attttgcgtg 


tgctggttac 


agccgggcac 


agtggctcat 


26640 


gcctgtaatc 


acaacacttt 


gagagggtga 


ggcaggagga 


tcacttgagc 


ccagaagttt 


26700 


gagaccggcc 


tgggcaacat 


agtgagaccc 


catctctaca 


aaaaactttt 


ttaataagtc 


26760 


gggcgtagtg 


gtgcatagct 


gtagtcccag 


ccaccaagga 


ggctgagttg 


ggaggattgc 


26820 


ttgagcccca 


ggaggttgat 


gctgcagtga 


cctgagatta 


ctccactgta 


ctccaacctg 


26880 


agcgacagag 


caagacttgt 


ctggggaaaa 


aaaaaaaaaa 


aatatatata 


tatatatata 


26940 


tatatataca 


tatatacata 


cacgcacaca 


cacataatat 


aaaaatatat 


atttataaat 


2.7000 


atataatata 


taatataaaa 


atatatattt 


ataaataaaa 


tttataaatt 


atatttataa 


27060 


gtaaatatat 


aatatataat 


ataaaaatat 


atattatata 


atatataata 


aaatatataa 


27120 


tataaaaata 


tatatttata 


aataatatat 


aatacatact 


tataagtata 


tatttaaaat 


27180 


atatgtaatg 


tatatttttt 


aatgtatgat 


atataatata 


catttataaa 


tacacattta 


27240 


tattatttta 


tataaaatat 


atataaaatc 


tccaagttgc 


tttttccaaa 


aaggtgtctt 


27300 


gctgcatttc 


aaacattcat 


ttaaaaactt 


gaatgctggt 


gatctggtcc 


agaatgtgtt 


27360 


cagtagctgc 


tgccagtggc 


caagcatctc 


gggagatgtc 


tacaaaacac 


gctggttctg 


27420 


gcctggcgtg 


gtggctcacg 


cctgtaatct 


cagcactttg 


ggaggctgag 


gcaggtggat 


27480 


caactgaggt 


ctggatttcg 


agaccagcct 


tgccagcttg 


gtgaaacccc 


atctctacta 


27540 


agaatacaaa 


aaaattagcc 


aggcgtggtg 


gcatgtgcct 


gtaatcccac 


ctacttggga 


27600 


ggctaaggct 


ggagaatcgc 


ttgaacccag 


ggggcagagg 


ttgcagtgag 


ccgagatcgc 


27660 


accattgcac 


tccaggctgg 


gcaagaagag 


cgaaactccg 


tctcaaaaaa 


aaaaaaaaag 


27720 


atgctggttc 


ctaaaatgtg 


gcccttttcc 


tcctcacctg 


ctgccagacc 


atcagccgcg 


27780 


ccttcatgaa 


cgggagctcg 


gtggagcacg 


tggtggagtt 


tggccttgac 


taccccgagg 


27840 


gcatggccgt 


tgactggatg 


ggcaagaacc 


tctactgggc 


cgacactggg 


accaacagaa 


27900 


tcgaagtggc 


gcggctggac 


gggcagttcc 


ggcaagtcct 


cgtgtggagg 


gacttggaca 


27960 


acccgaggtc 


gctggccctg 


gatcccacca 


aggggtaagt 


gtttgcctgt 


cccgtgcgtc 


28020 
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cttgtgttca 


cctcgtatga 


gacagtgcgg 


gggtgccaac 


tgggcaaggt 


ggcaggctgt 


28080 


ccgtgtggcc 


ctcagtgatt 


agagctgtac 


tgatgtcatt 


agccttgatg 


gtggccagga 


28140 


ctggtagggc 


cctcagaggt 


catggagttc 


cttcgtggag 


cgggtgctga 


ggctgtatca 


28200 


ggcacagtgc 


tggctgcttt 


cacctgggcc 


gtctcaccga 


agtgtccatg 


gagcctgcgt 


28260 


agggtgggta 


tctgtgtcga 


ttttacagat 


gcagaaacag 


gctcagagaa 


accgagtgac 


28320 


ttccctaagg 


tcacataccc 


agttagagca 


gagctgggcc 


aggaagtgct 


gtctcaggct 


28380 


cctgaccagg tctccttgct 


ttgcactctt 


gccaaaacca 


tgatccagaa 


ctgactttga 


28440 


ggtccccgga 


cctcaggctc 


ctccgaaatg gcctcttgga 


ggctgctgag 


ccacagctta 


28500 


ggacccacct 


cgagaggcaa 


atgtgctttg 


agctgccagg 


cgtcctgggg 


gccctgcctt 


28560 


gggcacgggg 


ttcagacagg 


ccccagatgt 


gtggggcgtc 


tttctggact 


tgagttttct 


28620 


tttctgtgtg gtggacacag 


tgctcacccc 


ttaaagcacc 


tgtgatgtgt 


gcagcagccc 


28680 


aatccctgcc 


tgtcgcctgt 


tctgctaggg 


aaggaaggaa 


gacttcagga 


tggcaggaca 


28740 


acagaaagag 


gtccaggttt 


tagagcaagg 


gcaggtcaaa 


cttagaaaat 


tctggaatga 


28800 


ggatgtgcat 


ttcctcttct 


ggatctgcta 


aaagaagagg 


gaaggagggg 


ctgctggggg 


28860 


aggagcccag 


agccgagttt 


acatccggat 


cccgcaaggc 


ctcccctgcc 


ctgaggtctt 


28920 


gttttgtgat 


gfcgcttgtgt 


ccatcctggt 


ttctgccgtg 


tccccaacat 


ccggccaagc 


28980 


ttaggtggat 


gttccagcac 


acactcaccc 


tgtctgtgca 


cctgtttttg 


tgtccgtaag 


29040 


tgggtattta 


ctcaccttac 


gagtgagcca 


ctgtgggaat 


tcagggaggt 


ggcgcagtga 


29100 


ccacccctgg 


agggatatgt 


gtgtggcagg ggtcgagggt 


ctcgcccttc 


cctgcttcct 


29160 


gcgcgtggct 


ttctccagga 


cggggagggc 


tgagctgaag 


aggtggggac 


agttgcgtcc 


29220 


ccccgccacc 


cactgtcctg 


cggtgagagc 


agactcactg 


agcctgccct 


tctcccttgt 


29280 


gccttccagc 


tacatctact 


ggaccgagtg 


gggcggcaag 


ccgaggatcg 


tgcgggcctt 


29340 


catggacggg 


accaactgca 


tgacgctggt 


ggacaaggtg ggccgggcca 


acgacctcac 


29400 


cattgactac 


gctgaccagc 


gcctctactg 


gaccgacctg 


gacaccaaca 


tgatcgagtc 


29460 


gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg 


cggggcagcc 


29520 


gggcgttggc 


cacctcccag 


cctcgccgca 


cgtaccctgt 


ggcctgcaag 


ttccccaacc 


29580 


tggcaggagc 


tgtggccaca 


cccacgactg 


cccagcagcc 


tcaccctctg 


ctgtgggagt 


29640 
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tgtccccgtc 


cacccctggg 


tgcctttgct 


gcagttatgt 


cgggagaggc 


tctggtgaca 


29700 


gctgtttcct 


gtgcacctgc 


tgggcactag 


gtcccagcta 


atccctgtgc 


caggactcta 


29760 


atttcaccct 


aacacacatg 


gtggttttca 


ttgctgggga 


agctgaggcc 


tgagcacatg 


29820 


acttgcctta 


ggtcacatag 


ctggtgagtt 


caggatcccc 


cagagatacc 


agggccagca 


29880 


ctcgatcccc 


acccagccct 


gaaccccacc 


atgtgctggg 


attgtgctgg 


gagtgtccac 


29940 


acgcctggga 


ccccagggct 


ggtgctctca 


tctccttttt 


ccagatcatg 


agaatgaggc 


30000 


tcagggaagt 


ttgaaaaaaa 


cctatcccaa 


gtcacacagc 


aacaggagca 


ggatttgaac 


30060 


ccagaaaagg 


ggaccgcaca 


ctctgttctg 


ctagagtagt 


tagctgtcct 


gggtgatatg 


30120 


gcaggtgaca 


ggggcaactg 


tgcttaacaa 


aggaaccccc 


atcccccctg 


ccaagttggg 


30180 


agactagaag 


gtcaggggca 


gaagctctga 


agggccaggt 


gcagtggctg 


acacctctaa 


30240 


tcccagcact 


ttgtgaggcc 


aaggcgggca 


gatgatttga 


gcccaggagt 


tcaagatcag 


30300 


cctgggtaat 


gtagtgagac 


gccatctcta 


caaaaaaatt 


ttttaaaaat 


tagctgggca 


30360 


tggtggttca 


tgcctgtagt 


ccaagctact 


tgggaggctc 


aggtgggagg 


attgcttgag 


30420 


cccaggaggt 


tgaggttgtg 


gtgagctgtg 


atcatgccac 


tgcactccag 


cctgggcaat 


30480 


agagtgagac 


cgtctccaaa 


aaaaaaaaaa 


gaagaagaaa 


aagaagctct 


gaggctccaa 


30540 


gtccccaggc 


accccttggc 


ttgagggcag 


acaagggagg 


agagggtcac 


ctgggcagcc 


3 0600 


ctgacttttg 


tcccctggca 


aagggacctt 


cagtgacctt 


ggccctagga 


gagcctctga 


30660 


gcacgtcagc 


catgtcgaac 


cgctcaggaa 


gggcagcaag 


aatttggctt 


ctgacctctg 


30720 


cctctcctac 


tcgccatctg 


cactgggtgt 


ggttgtgccc 


attttacaga 


tgaggaggct 


30780 


ggggcatcga 


ccagctgaat 


gccttgtecc 


aggtactgcg 


taggcagagc 


tggcagttga 


30840 


accccgtgtc 


ctggttgtcg 


ctgggggtgg 


gctgcaccct 


gacttgtgag 


gccagtagca 


30900 


aggtttgcac 


gtgacttcgt 


gaccgtcacc 


cagctctgca 


gcacatcccg 


tgacccagct 


30960 


catccaggcc 


gcatgcaaac 


ctgttgccag 


gcgagaaacc 


agtcaccgca 


cagctgtggt 


31020 


tgcctgaaat 


gattaagctc 


attaatcacc 


ccggagtgag 


gacagactca 


gatgaaaacc 


31080 


agcaaaagcc 


ctggaaactc 


atgtgaccct 


gccaatgagg 


gcggccatgt 


gcattgcagc 


31140 


ctggccgtca 


ctcctcggta 


cgtgttttgg 


acttaaacgc 


tccggatgtt 


tactgagtgc 


31200 


ttgattaata 


acatggaagg 


cctggtctca 


ttgctgtggg 


agtgaaggat 


gcacagccag 


31260 
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gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 
gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 
cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 
gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 
gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 
ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 
cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 
tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 
atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 
ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 
gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 
tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 
ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 
tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 
tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 
agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 
gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 
ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 
gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 
acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 
tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 
agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 
aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 
gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 
ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 
tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 
acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 
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ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc 
ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct 
agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc 
gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg 
tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt 
cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta 
tacattccac caccagtgtt cactaccatt acctgccttc- cgtcttaagt 
taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca 
aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcetgg . 
gaaactccat ctctactaaa aatacaaaat tagctgggca\t-.ggtggggca 
cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc 
gtgagcccag atagcaccac tgcagtccag cgtgggeaaq agtgcgagac 
aaaaaaaata aataaataaa agaaaaataa atttatgafcc. tatttcaaaa 
tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat. 
caagagacag ccctggtccc cccgtctttc ccgtggacct: gcagcggggc 
ccgcctgttg tcgagtggcg tgctatcccg tcctcqagct . cctctgtggc 
ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct 
aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct 
ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa 
tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca 
ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt 
tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat 
gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg 
tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc 
ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta 
gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg 
gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga 



gctcgaagtg 
gttgagaggg 
tggtaccagt 
tgctgctcac 
ttccgcagtt 
tgagtfctcac 
gttcatcatt 
gcactttggg 
. ccaatatggt, 
t.gcctataat 
agagct;taca : 
tccatctGaa- 
ataacacatg 
agcgcaeccc-. 
agatgctgag. 
ttacagaeac 
gttcagccag 
catcctgccc 
gttcatctac 
ggcaggtgcc 
tccaggctgc 
gatgacttgg 
agactgtgat 
agaccccaca 
gttggtactg 
ctacaatgct 
cacaactcat 
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agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560 

gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 34620 

atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34 680 

ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740 

ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34800" 

tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34 860 

gcgcagttcc ctgcc'ggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920 

tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34 980 

ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040 

tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100 

tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160 
cgcggccggc cagatttgca tttttgaaac aactgctagg. ctgggcgcgg tggctcacac * 3522 0 

ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 352 80 

gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340 

tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400 

aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 35460 

acaagagcaa aactccatct caaaaaataa aaaatagaaa aacaagtgct gtagcggaag 3 5520 

tgagcacttt gcggagtcag gcttgtgtgg cctgttccac aaatgatgtg ctcacggtgg 35580 

cctcaggccc acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 3 5640 

aggacaggcc tggctcctaa gcagccttct tttacaaaaa ctgcagagcc cgcctgtatc 35700 

ctagcacttt gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaccagcctg 35760 

gccaacatgg tgaaacccca tctctactaa atatacgaaa attagctggg tgtggtggca 3 5 820 

cgcgcctgta gtcccagcta ctcgggaggc tgaggcagaa ttgcttgaac ctgggaggtg 35880 

gaggttgcag ggatctgaga ccatgtcatt gcactccagc ctgggcaaca gagcgagacg 35940 

ccatctcaaa aaaaaaaaac ctacagagcc acacggcctc tttctccacc gagtgttggt 3 6000 

gtgggagctt gtgttattgt ggtgaaatct tggtactttc ttgaggcaga gagaggctga 3 6060 

gcgcctggag agactttcac atgggtcgcc atgtccgccg tcggtttcgc tgttgtgctc 36120 
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cccatctgaa 


ggctggtgcc 


gtccagacag 


gctggacgcc 


cctttccacc 


agatccttcc 


36180 


tcccgcagca 


gtttctagtt 


acgttgtact 


gtgaggtctg 


tgtccttggt 


tgatggcaaa 


36240 


agtcagccga 


attgaaattc 


agagccatgc 


ctggctccct 


ggagcttctc 


tcctgggcag 


36300 


ctgtgatcat 


tgcctctgct 


gtggtgtggg 


tggtggaaat 


ggattccttt 


catcttgctt 


36360 


gctacaggtg 


actgtcacgt 


ggagtccttt 


ggagagaggg 


acgtgttaat 


tgatggatgt 


36420 


ggctcccatg 


ctgagaaagc 


tcctgggcgt 


acattgcctt 


agagtttcat 


tggagctgcg 


36480 


ttcttttatg 


gtgtctgcta 


ggcagaagtg 


atgaagactt 


ggaagaaaac 


ccagaaggtt 


36540 


ttccacttaa 


tttggaaaat 


gtgcttttcc 


cctcctgtgt 


cttttgctaa 


ggtccagcct 


36600 


cctgcagcct 


ccccgctctg 


tggactctgg 


ctttgattct 


ttattaggag 


tccccctgct 


36660 


cccccaaaag 


atggtgtcta 


aattatcatc 


caattggccg 


aggttttgtt 


ttctattaat 


.36720 


tgtttttatt 


ttttattgtg 


gtaaatttat 


ataacataaa 


atttgccatt 


ttaattgttt 


36780 


tgttattgtt 


gtttttgaga 


cagggtctca 


ccccagtgcc 


caggctggag 


tgcagtggtg 


36840 


cgatcatggc 


tcactgcagc 


ctcagcctcc 


agggctccag 


tgatcctctc 


acctcagcct 


36900 


ctctagtagc 


cgggactaca 


ggcafcacact 


accacatctg 


gctgattttt 


tgtatttttt 


36960 


ttttattgta 


gagacccgct 


atgttgccca 


ggctggtctc 


aactcctgga 


ctcaagccat 


37020 


cctcccacct 


caccctccca 


aagtgctggg 


attacaggca 


tgagccacaa 


cacccagcca 


37080 


ttttaatttf 


tttttttttt 


tttgagatgg 


agtctcactc 


tatcgcccag 


gctggagtgc 


37140 


agtggcgtgg 


tatcaactca 


ctgcaacctc 


tgcctcccag 


gttcaagcga 


ctctcctgcc 


37200 


tcagcctcct 


cccgagtagc 


tgggattaca 


ggtgcccatc 


actatgcctg 


gctaattttt 


37260 


gtatttttta 


gcagagacgg 


ggtttcacca 


tgttggccag 


gctggtcttg 


aactcctaac 


37320 


ctggtgatcc 


gcccgcctcg 


gcctcccaaa 


atgctgagat 


tacaggtgtg 


agccaccgtg 


37380 


cccggccttt 


ttttgttttt 


gagacagggt 


cttgccctgt 


cacccagact 


ggagtgcaat 


37440 


ggtgggctct 


tggctcactg 


cagcctccgc 


ctcccaggct 


caagttgtgc 


acctccacac 


37500 


ctggctaact 


gtattttatg 


tagagacaga 


tttcaccatg 


ttgcccaggc 


tgggcttgaa 


37560 


atggactcaa 


gcagtccacc 


cacctcagcc 


tcccaaagtg 


ctgagattac 


aggcgcgagc 


37620 


caccgcaccc 


agcccatttt 


acctattctg 


cagttgacag 


ttcagtggca 


ttcagtcagt 


37680 


tcacgaggta 


accatcactg 


ccattcatct 


ccagactact 


tcaccttctc 


ggcagatgtc 


37740 
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cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 
tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 
tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 
gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 
ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 
aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 
ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 
aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 
gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 
acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 
tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 
aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 
atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 
ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 
gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 
cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 
aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 
agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 
ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 
aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 
actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 
cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 
tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 
gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 
tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 
ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 
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gcgcgtgtgt 


gtggtgtgtg 


gcgugugngc 


gtgtatgggg 


gaggcaccct 


ttccatctgg 


39420 


gtccaagaga 


ctgggcctgg 


ggaagacgct 
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tctttttatc 


tacttagaga 


ctttgtttta 


39480 


tttgtatttt 


tttgagacag 


yycCtCaCtC 


tgtcacccag 


gctggggtat 


ggtgatatga 


39540 


gcatagctca 


ctgcagcctc 


ggcctcccag 


gctgaagcga 


tcctcccacc 


tcagccttct 


39600 


gaatagctgg 


gactgtaggc 


gcgcgtcacc 


at act gage t 
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ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 
ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 
tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 
acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 
ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 
tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 
tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 
cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 
tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 
cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 
tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 
cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 
ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 
ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 
ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 
ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 
ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 
acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 
agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 
gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 
tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 
ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 
gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 
ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 
ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 
tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 
cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 
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tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 
attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 
ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 
ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 
gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 
ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 
cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 
taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 
gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 
acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 
ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 
ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 
ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 
ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 
tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 
ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 
attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 
gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 
gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 
gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 
ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 
tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 
tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 
gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 
ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 
gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 
ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 
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catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 
caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 
ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 
gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 
ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 
ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 
gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 
tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 
tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 
tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 
atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 
tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 
caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 
aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 
gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 
agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 
taaaaataca aaaaattagc cgggcatggt agcgggcgcc tgtagtccca gctgctcggg 
aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 
ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 
aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 
acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 
gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 
agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 
ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 
ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 
agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 
caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 
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aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 
gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 
aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca 
tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 
cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 
tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 
gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 
tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 
taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 
aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 
cttagtttct aaagagttgg ggcttttcca gctgtctctc tgcaactgat ttctaattta 
attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 
gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 
gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 
tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 
tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 
gtttcaccat gttggtcagg ctggtctcga actcctgacc tcgtgatccg cccacctcgg 
cctcccaaag tgctgggatt ataggcgtga gccactgtgc ctggccatta ggtgtgtttt 
atcacccagc atcatgcagt ttatcttggt gaatgttctg tgtactcttg aaaagaatgt 
ggattctgct gttgttgggt ggagtgttcc agaaacatca attagatcca gttggttaat 
agtgctcatc aggttgtctc tatccttcct tcctgactgc ctgcttgagc tgtcagttat 
tgacaggggt gtggagtctc caactctaat ggtggatttg tttatttctc ctagtagttc 
tatctttttc tctccttcta cccttgatcc tcttctcccc ctagggcttc ctggtgttag 
tggtgggaga gtggggtagt gaagaacctg gactttaggg ccaaagaggc cagggttcaa 
atcctggctc tgtcacttcc cagttgagtg accctggctg gtgcctgaat ctctgtgagc 
ctccacttcc tcctctgtga aattgagagc acttacctgg caggctgtca tgggcatcaa 
gtaacagggc actccacctg gaccctgaca cgtgatgcac aggaatgcca gctgctatgc 
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agtgaggcct gggccgattt cccagcatcc cctcctgagg ccagcctctg tttcctgtga 
ttttctgcac aaagtgggag ggaggagtcc taggaaafcgg ggggccacct cgaagcctag 
gcctcctctg gcttctctgt gccagtgccc ccacgctttg tgtctgtgtc cccagcccat 
gggactctgc tattccctga gtgctgccgc atgcccagcc cgcactgagg acgtggagcc 
ccgaggggca ggatggcctc catggtcaca cgtaggaagt ggcctccacc ctccgatgat 
cctctccctc ctccctttca gcgccctccc cgggggtgtc ctcagccctc ctgcctgtgc 
tttgtcccgt cttctgcagg cgcctgggac gtgctgacag gtcctctgcc ggctcctgcc 
ttgctatgcg cacgctggtc accacagagg cctggccctt cttctgtagc agtcccacac 
ccgcaacagg tgtggctgct gaccacctgc tttctgcccc tctggtcctg aggagggcgc 
agtgggcact caggcgtggc tgagcagatg tgtgttgccg ggaggaggaa ggactgctcc 
agtcagggct gaatttccca cccggagcat ttctgctgta tttggtgtag cgcctgctgc 
ttaaagctct gattcccagt tggcaccctt tcccttctgc attgaaaaac atacggatgc 
atgtcttctt gcagtgaatg tgtattctcc cagcctctct tctgggttgg ggctggaggt 
ggagcggcac acaggagccg cagcgatgga ggatgtgcgg gtgcagcacc ccgtacagca 
gggatgccaa acccgcgctg agtccctctc aacttctgct ttgaagccca gtcacgccat 
tgcctgggtt ttgctgggcg gggctgcgtg tgatgttctc ctctgtccct cccccagagc 
cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 
gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 
ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 
gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 
ccctccccgt caaggctctg ccaagaccct ggccctgccc tccgggatac gagcttgggg 
ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 
agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 
gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 
gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 
tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 
taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 
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ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 
ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 
agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 
ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 
gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 
gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 
ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 
gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 
cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 
cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 
gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 
acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 
tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 
tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 
gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 
tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 
caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 
ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 
tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 
cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 
ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 
agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 
gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 
cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 
tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 
catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 
cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 



54000 

54060 

54120 

54180 

54240 

54300 

54360 

54420 

54480 

54540 

54600 

54660 

54720 

54780 

54840 

54900 

54960 

55020 

55080 

55140 

55200 

55260 

55320 

55380 

55440 

55500 

55560 



BNSDOCID: <WO. 



0177327A1_I_> 




WO 01/77327 PCT/US00/16951 

185 



aggcagtggc 


4- /— • 4- /— » a a -a ri ft 
Lyddayy 




caggcaggda 


ptptpt^? PTP 1 1~ Prapr 

yyy^-y <-gay 


4— 4— -a /~r a 4* a ptpt 

tLcagatayy 


J J O <i u 


— /— w 4** AT A"* f*5 ^ ^ 


#-» -a /"t+" p» a 4~ 4* /*t4"" 

yay tuaL eye 


<*"r*r o+'oroTOrOTa 0 

gLCtygygac 


acggccaccc 




9 a yyy accc y 


jjOOU 


ptptp* 4* /*f a rrarfp 


l— ^_ O LLULLUd 


L.yyv_L»y^cty «_ 


pnnrra rta a 


PT+~ P*t~ PTt" t~ PIPTPT 

y i-c uy u u yyy 


prrrpf a pt a a ptptpt 

gggagaaggg 


O ~> / *± u 




CLVJ ^ ^ CX w I— V-» w w 


i«y w ^ ^ v» l» l> 




t~ pi p* p* a crcr t~ p»a 
uy w cty y_ v» v» 


ptpi rh t~ pra ptptp 1 
yyL>L. l, y ciyy 


55800 


r-» 4- da a ptptp* a pt 
uy dcty y ^.cty 


4- pfprt - pt pt pictpi p* 

u yy u yyyyy ^* 




l»-»rTr-«f~»4 _ r-«r~ , +" O t~ 


np +- PIPTPT paarr 


t - f~p»p"p'aprt~ p"t* 
t_ 1_ 0 cty l.^-. L. 


C C Q£Z(\ 
J J ODU 


ydCyyyccty 


ugecyuggge 


cccoyctyty 


rrrfrr/TPrtP 4- <^r+- 

yyaycyc^y 1 - 


•4- ««f3 4- rrprtp a PT 

Ly acy cy cay 


codyyectuy 


D 3 J7<6 U 


ccgccagagc 


ccgcacgcfc t 


CCdtCCCyCL 


gacttcatcg 


acgccctcag 


gatcgctggg 


r r n □ n 


ccyycccuyu 




aty uyy c u c u 


tyceaaay ll 


gagee uggag 


/^» p* 4" rrrra a a 0 4~ 

cctyyaaacL 


DOU*tU 


CCCCLaCyyy 


cayccLtyat. 


ay c sry a gcgg 


cccaaggagc 


CCdCCCdyCC 


yaccccyccc 


jQiUU 


CCCCCyLyyC 


rggrgggegg 


caccaggggc 


tgccugyctL 


4— *»>r*"i 4— /t 4t" 4- ^ a 
tyCUCyLCCd 


y« y-i a a 0 a +• 0 a o* 
COddtdCCdC 




ctgggctggc 


cagggcgcgc 


tCdCutCCyC 


caccaccgag 


ggece ugggc 


gaaggaguga 


C COT ft 
O D Z Z U 


ataccaggct 


gccttggcag 


ggatgtgttg 


a 999 c tgt:gg 


ggagfc eggae 


a 9 c 99 c 9999; 


jQZOU 


gtcagaggag 


gaggagggtg 


caccgtgcag 


gctgaagggc 


cacgttaccc 


i-gagguuggc 


Aft 


caggc ucccc 


a.g gec tagc c 


^ n ^ ^ 4" /"» #^ 

tcccayct.ee 


/«t M M nM4f-f ^— ■ 4*" 

cccactLtcu 


ccccaccctc 


caccayuyyc 


^ ft ft 


daagccaycc 


j*i 4-* --j /"t^t 

ccLLcagyyo 


ycdcyyuy lc 


tycccccaag 


gagggcccac 


tucguugggg 


c /r a /r ft 


uudaLyLLyy 


/~1 ,~» M S-< 4~ y— ( -l— 4- 

ccaccccttu 


o+- ot+- ffrrt-rf 
CtyLtuy uCl 


c tggcagaaa 


ccaccaagcc 


ycccucaydc 


c; ft 


gacagcccgg 


cccacagcag 


cgccaucggg 


cccy LCdLcg 


gcaccatccu 


y-i 4- r-« 4- /-» t** ^» 4" +~ y^i 


^ K P ft 


gt cat gggtg 




ugtgugccag 


cgcguggugu 


gccagcgc t.a 


wgegggggee 


iCiC A ft 


aacgggccct 


ccccgcacga 


ytatgucagc 


gggaccccgc 


acgcgcccct. 


f% t\ — i ^ 4— 4— y« a 4~ a 

CddCttCdtd 


I^^7ft ft 


gccccgggcg 


y Lucccdycd 


4- f~* t~* r~* r"* \~ f~ 

Lyy LLLL L. 1_ 


a a /"T^r* - a a rrrr 

dLdyy Lddyy 


ageccgagau 


a 4" rrrra a 4" na 4~* 

atygaatydL 


JO / o\> 


c cggaggagg 


C dyyayay t. d 


ort" o > f"/Tororo»aor 

gcccgggcdg 


ecteggggag 


4- f OTP* a i-TPTPT 

c ggagc aggg 


a 4" /^ , 4 — ^t/~» 4~ a p»p^ 
cl Ly tyC L.clv* u 


croon 


ccagycccuc 


LtyLdLdty L 


r~m/~> anap -a 4- 4- 




p» a 0 1 a r~r p< afhr 
C a C dy LdLLL 


aprr^p«4-4-4- p»p»p» 


C£ Q Qft 
JOOOU 


actgagcctg 


cgc uuggcac 


cagaac.ee uu 


caacacagag 


gcccgcacgg 


ccgcagcaac 


c; C Q A ft 


ccaccctttg 


gcactgtagg 


tgtggagaaa 


gctccttgga 


cttgaccttc 


atattctagt 


57000 


aggacatgtg 


ctgtgttgtc 


cacaaatcct 


catgtaccct 


agaaatgaat 


gtgggggcgg 


57060 


ctgggctctc 


tccagagctg 


aaggaatcac 


tctgtaccat 


acagcagctt 


tgtcttgagt 


57120 


gcagctggga 


tttgtggctg 


agcagttaca 


attcctacgt 


ggcccaggca 


ccaggaacgc 


57180 



BNSDOCID: <WO 0177327A1J_> 




WO 01/77327 PCT7US00/16951 

186 



aggctgtgtt 


tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 




tcacatgggt 


gaccatggta tgtctaagaa qcrtQcraQtcc ctataaaatc t-araaat-cmn 


O / O u u 


cccacagctc 


caggccacct tgaggattqc ctctcrcctcrc ccaaccctaa aft-rrrt-rhr 




ccctgtcctg 


tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 


1 A*) f\ 
D / U 


gtatagatgt 


aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 


3 / ft O U 


aggagcccac 


agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 




ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 


O 1 O U U 


tggaggaggt 


accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 


3/DOO 


gtggccagag 


aggcctgcgg ctcagggagc ggggcacatt tccaacagga cacaccgccc 


3 / / A U 


tggtctgagt 


ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 


3 / / a u 


tggctgagcc 


tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 


3 / 0*fc U 


gaaagtccat 


gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 


3 /SOU 


accggaacca 


cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 


57960 


acccgccggt 


gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 


58020 


ccaccgtcag 


tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 


CortDA 


tgaggcatgc 


ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 


Dol4 0 


ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 




ccacctccta 


gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 


JO^uU 


cggcccagcc 


catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 




cagggttccc 


tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 


c Q o o n 


agctggatgg 


ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 


C Q /I /t A 


cctggagctt 


ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 


c o c n n 
3o3 UU 


actgtgagga 


gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 


3 C 3 O U 


ccccgtttct 


gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 


58620 


tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 


58680 


gcagctgtat 


gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 


58740 


tgagccccac 


cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 


58800 



BNSDOCID: <WO 0177327A1J_> 




WO 01/77327 PCT/US00/16951 

187 



gtcttgtctt 


cttgt tttc t 


tt ttcttttt 


tttttcracra c 

w w- w- • w y-j u. ciw 


crcr a cr fc fc. fc c* a c 

w| w wL^j U U U L CL U» 


fc fc fc fc crfc fc ctp n 
l u u uy u uy l u 


JOODU 


caQcrct tcacr 


tQcaatcrQtcr 


tga fc etc tag 


gctcaccgca 


acctctacct 


ccc^acrcrfcfcca 

w v— a wj y u u w. ci 


58920 


agcagfc tccc 


c tercet caae 


c tccc tagfca 


actcrcrcratta 

yw ^yyyaL l. cl 


c a cr cr c a fc cr fc cr 

33 3 Zj 


w w CLU CL wy U. w 


coqqn 


caact*aattt 

w ucm u u u 


tatattttta 

Ly l l u u u O- 


ot" acracratcTCf 

y L- c*.^^ ui y cl y y 


crcrt t t"ct" r*ca 

yywL.UwL.wwd 


fcnfc fccrcffc ran 
uy u uy y ULdy 


crpfcciafcpfcfcp 
yLLydLLL Ly 


c; on a n 


ClCltw U LL L L a L 


pfc pa pert - t* 
w w w cty y y d l 


rrnr rpappl* 

w L> w w CIL- w V— 


L.yywwwwwwd 


aay uy l uy y y 


d l LdLdy y lq 




haaorrarra 

u y y u.l^ ww 


rarrt* ctptp c fc 

w CL w w U '—4 M w w w 


ttttcttttc 

l_ Lj>L>wwL>L»L»L> w 


fc fc fc t" r* t* 1 1* t" r* 

UUUUU.UUUUU. 


fc fc fc fc fc fc r> fc rra 
U L U U U u l Ly d 


pa pappp;f* pt~ 
y acdy yy lcl 




cactctotca 

w« W U l»H L~ w CL 


w w l dy y u Ly y 


acitcrcaatcfcj 

cl^j ^y ^* CLCi wy y 


uy UwdUwduy 


yL uaaL uyLd 


pp p fc /""•+* a o r~» 
y L L LL U dL L L. 




fcc tacjcrefcca 

W W W ^* J w w \+ < 


aacaatcctc 

t-L ~n w wl CL W W W w 


ccatctcaac 


ccctaacrfacr 

wu>w ucLCLy ucy 


f fc a crcra c fc crc 
w uay y aL uy l 


r3 y-'r-r/-' a fc CTP a fc 
a u.y l a uy l a u 


C Q ft 
-J J^OU 


w U V>UClL.y www 


a err 1 fc a a fatt* 

ay L UCld UA L U 


tacatttttt 


nfc a era era fc era 
y Lay ay a. uy d 


acrfcfcfcca ofca 
dy ULLLdLLd 


fc a fc t~ p c* t~* z^ci 
Ld l Ly l l Lay 


o _? «j *± \j 


rrr* t" ercrfc rhpp 
y l uy y ul u l l 


dCLw www y ct 


w w L»y ay wy d l. 


c r* fc c* c fc rir" r* +* 
LLLLL Uy U w U 


Ly y L L LLLLL 


appfcppfcppp " 




a t* t a pacrcipcr 


hna orr»r*a p ppr 
uy ciy w wCiw v_y 


1* cr c 1 r* t" ctci r* c t* 
u»y ww u>yy ww u 


nnnnt* a fc fc ert" 

yyyy udu uy l 


ot"fcr't*fcafc rrrr 
LLLLL LdLyy 


pappfcpapfcp 
LdLL uy au Ly 


J D vj 


fc cr a fc crcrcr c c p 


t~ crcr cr a a crcra a 

yyy y y a.d 


y t^cxy wdyddy 


a crcTCrfc hphf*p 
ay y y u uu u u w 


fc fc aa fc fc fc r* r* fc 

L Lyy L L LLL L 


ppra pa pfc a a p 
y y d l ay u aau 


COCOA 


fc cr a cr fc cr fc i" a fc 

uyay i— u»w L 


y y ciy y ll l Ld 


CTCTCT C* C t" CTCT C* f" 

yyywwwyyw l. 


fc fc crfc fc fc a fTfrn 
u L.y u u Lay y y 


dLdddy yy dd 


p fc p/p/fc a a p pa 
L Ly y LddLLd 


cqcon 
3 i?3 O 17 


y aay i^L-y ciy a. 


Pfc fct~aa£3Pap 
y tL LaaaL.au 


wu.dwL.ywww l. 


■fr* ofc fc ppplTrr' 
u u. u u w w u uy w 


L Lw cy L Ly L L 


p/p a ^3 pt /-^ ^ <~« /-i 
y LddLL Cay C 




fc fcaa ppa pp p 
u wddw L>ay w w 


rj p/-ra rrt~ pp t~ a 
ctyy ay tyL Ld 


yyaaLLLaay 


Ccty yy ccccl 


y dy LdLdLdy 


p a rfrrr' a p"P 4~ /~» 
CayyCayC ll 


c 0*7 Pin 


auyaaLUL-UL. 


fc fi-f-rrhnf fc 

L» L. L L w l uy u u 


w w ww w l. L.yyy 


ay c Lgggagg 


dLLULddwCd 


y y l aa u day a 


Df/Dy 


y» uy y LdL Ly 


=a pr" a ctpp afrr" 
dywdywwdyw 


t a at- 1- 1- f- 1- 1- a 

LCICIL L L L L. L>d 


aat"i~ , act*t"t"a 
ddUwdwUuUd 


4~ fc *~t+* t" fc a a r"* p 
L Ly L L LddLL 


a fc a fc p/a p fc pa 
d Ld Ly dL L Ld 


COOT n 


cccactt"aaa 

www Ct W L. U CLCLCL 


a a a a acr t a p a 
»«ay y y taLa 


y w w wdy uyyy 


t fc fc fc a crfc cifc a 
u u u Lay uy u a 


fc fc p a p a p/a fc p; 
l LLdL dy d uy 


fc crfc ctp aarrp 
uy Ly LddLLL 


coppn 


tcaccaracrf 


fc a a 1 1 1" t aera 

w act u w u u ay CL 


CL w CL w L. L. L> w w L. 


pre* c c c fc a a a a 

yLLLL Ldddd 


p;a a a p fc pfc ctp 
ydddL ll uy l 


a fc pa app pap 
a uy aay l uay 




ctotttttaa 

w uy u tuuucia 


at*f" acrna a a cr 

CL U U CL>H w CL CL CL y 


L, ' — d w Lj w uy w d 


fcr*c*fcfcfcaaafc 

LLLLLUdddU 


a fca fcpj"fc fc pa fc 

dLdUy ULLdL 


pcrfc a e* 2i a 
yy udLddddU 


O UUUu 


hraaaa era i~ a 

u u aaaay a. Ld 


Lay aay ay ll 


L. y w ay LLLaci 


aaarra rtrrrr 
dy dy dL u L l y 


l LLLLd uy dL 


pn p a appapp 
y ll aay l ay y 


VJvUuU 


p a 1" p p p fc rrrr p 


a ppp a t~ pp/p p 

yy yy w 


l. w w uy Lay L.y 


fc cr fc fc fc n fc fc r* fc 

LyLLLLLLLL 


duyuLLLLLL 


a nrrrrrri* p a fc p 
a 999a LwdLL 




fc.CjfcaPafca ffl 
wy L*CLwCl wOL Ly 


p a a ppa fc a pa 


»y ciy L-y tyya 


cfcfcfccrfcfcfcfcr« 

L U U Ly LLLLL 


Lddy l Lay dd 


pa fc a a fc fc pfc a 
y dLddL uy Ld 


fifii fin 


gatttatgtg 


cagttgtgag 


aaagagcaca 


gacccattta 


tcctctgcct 


ggtttccccc 


60240 


agtgctgcct 


gccatcttgc 


atgacttcca 


ttcctatcat 


aagcaagaca 


ctgataacga 


60300 


ttctttcacc 


ttattcagat 


tgacataagt 


gttttttgtt 


tgttcttgag 


acaaacttcc 


60360 


tctgtcaccc 


agtgggagtg 


cagtggcaca 


atcacagctc 


actgcagcct 


caaactcctg 


60420 



BNSDOCID: <W0 0177327A1_I_> 





WO 01/77327 



PCT/US00/16951 



188 



ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 
tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 
tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 
tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 
catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 
ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 
cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 
tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 
ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 
acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 
ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 
ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 
aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 
gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 
taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 
gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 
cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 
ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 
cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 
aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 
atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 
atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 
tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 
cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 
accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 
ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 
ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 
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ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 
tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 
tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 
ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 
actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 
gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 
tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 
tcaccacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 
gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 
tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 
atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 
tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 
actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 
ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 
gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 
gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 
gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 
caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 
cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 
tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 
ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 
tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 
ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 
gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 
tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 
gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 
accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 
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tcctcttcta ttgcacatcc tcttctattg cac 
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<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 12 

ctgagcggaa ttcgtgagac c 21 

<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 13 

ttggtctcac gtattccgct cga 23 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 14 

ctcgagaatt ctggatcctc 2 0 



<210> 15 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 15 

ttgaggatcc agaattctcg ag 22 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 16 

tgtatgcgaa ttcgctgcgc g 21 

<210> 17 
<211> 23. 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 17 

ttcgcgcagc gaattcgcat aca 23 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificial sequence is a primer. f 
<400> 18 

gtccactgaa ttctcagtga g 21 

<210> 19 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 20 

gaatccgaat tcctggtcag c 21 

<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 21 
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ttgctgacca ggaattcgga ttc 23 



<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 22 

cuacuacuac uactgagcgg aattcgtgag acc 33 

<210> 23 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 23 

cuacuacuac uactcgagaa ttctggatcc tc 32 

<210> 24 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 3 3 

<210> 25 
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<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 25 

cuacuacuac uagtccactg aattctcagt gag 33 

<210> 26 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 26 

cuacuacuac uagaatccga attcctggtc age 33 

<210> 27 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45 

<210> 28 
<211> 13 
<212> DMA 

<213> Artificial Sequence 
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<220> 



<223> Artificial sequence is a primer. 



<400> 28 
aattcggcac gag 



13 



<210> 29 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 29 

ctcgtgccg 9 

<210> 30 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 30 

gtacgacggc cagt 14 

<210> 31 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 31 
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aacagctatg accatg 16 

<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 32 

ccaagttctg agaagtcc 18 

<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 33 

aatacctgaa accatacctg 20 

<210> 34 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 57 
<210> 35 
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<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 35 

- tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 36 

agctgctcgt agctgtctct ccctgga 27 

<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 37 

gccgtaatgc ggcacaggga ataagct 27 

<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificial sequence is a primer. 
<400> 38 

gagaggctat atccctgggc 

<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 39 

acagcacgtg tttaaagggg 

<210> 40 

<211> 163 

<212> DNA 

<213> Homo sapiens 

<400> 40 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 
gccggacaac atggaggcag ctccgcccgg . gccgccgtgg ccgctgctgc tgctgctgct 
gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 

<210> 41 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 
ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 
tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 
agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 
tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 
agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 



20 



60 

120 

163 



60 

120 

180 

240 

300 

360 
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aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419 



<210> 42 
<211> 221 
<212> DNA 

<213> Homo sapiens 
<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60 
gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 120 
ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 221 

<210> 43 
<211> 221 
<212> DNA 

<213> Homo sapiens 
<400>. 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 60 
tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 120 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 180 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221 

<210> 44 

<211> 156 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 60 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 12 0 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 156 

<210> 45 
<211> 416 
<212> DNA 
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<213> Homo sapiens 
<400> 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 

gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 

tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 

catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 

cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 

cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 

ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 



<210> 46 
<211> 198 
<212> DNA 

<213> Homo sapiens 
<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 
gaggtgaggc tcctgtgg 



<211> 244 
<212> DNA 

<213> Homo sapiens 
<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 
tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 
gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc' 
tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 
ggtc 

<210> 48 
<211> 313 
<212> DNA 

<213> Homo sapiens 
<400> 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 



60 

120 

180 

240 

300 

360 

416 



60 
120 
180 
198 



60 

120 

180 

240 

244 
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tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 12 0 

tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 

ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240 

cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 3 00 

aggtagcgtg ggc 313 



<210> 49 

<211> 255 

<212> DNA 

<213> Homo sapiens 

<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgfcggtgg 60 
agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 
gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 
tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 
aagtgtttgc ctgtc 255 

<210> 50 
<211> 210 
<212> DNA 

<213> Homo sapiens 
<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 
ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120 
accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag. 180 
tcgtccaaca tgctgggtga gggccgggct 210 

<210> 51 

<211> 352 

<212> DNA 

<213> Homo sapiens 

<400> 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 
tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 12 0 
ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 
acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240 
ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300 
actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 
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<210> 52 

<211> 225 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gcctcctcta , cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 
atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 
aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 
atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 



<210> 53 
<211> 235 
<212> DNA 

<213> Homo sapiens 
<400> 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 
gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 
caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 
ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 



<210> 54 

<211> 218 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 
cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 
gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 
cgcattgaga gctgtgacct gtcaggtacg cgccccgg 



60 
12 0 
180 
225 



60 
12 0 
180 
235 



60 
120 
180 
218 



<210> 55 



<211> 234 



<212> DNA 



<213> Homo sapiens 
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<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60 

gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120 

gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180 

ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 2 34 



<210> 56 
<211> 157 
<212> DNA 

<213> Homo sapiens 
<400> 56 

ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60 
ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg- ctcctgcaga 12 0 
acctgctgac ctgtggaggt aggtgtgacc taggtgc .157 

<210> 57 

<211> 272 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 
cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120 
tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 180 
gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 
agacgaggtg gactgtgacg gtgaggccct cc 272 

<210> 58 

<211> 134 

<212> DNA 

<213> Homo sapiens 

<400> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 
catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 12 0 
tggtgagcca gctt 134 

<210> 59 
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<211> 274 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60 
ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 
tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 
atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 
gccccttcac aggtaaggag cctgagatat ggaa 2 74 

<210> 60 
<211> 164 
<212> DNA 

<213> Homo sapiens 
<400> 60 

cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 12 0 
cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164 

<210> 61 

<211> 130 

<212> DNA 

<213> Homo sapiens 

<400> 61 

ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 12 0 
tcccctgcag 13 0 

<210> 62 

<211> 496 

<212> DNA 

<213> Homo sapiens 

<400> 62 

tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc 60 
ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120 



BNSDOCID: <WO 0177327A1 J_> 



WO 01/77327 



207 



PCT/US00/16951 



gccagcaagt 
ccccacagcc 
agctacttcc 
ccgggccact 
aatatatttt 
aactgtgatg 
ttaattttgt 



actacctgga 
agtacctgtc 
atctcttccc 
ctggcttctc 
atgatttaaa 

gggtgggcag 

aaaaca 



tttgaactcg 
ggcggaggac 
gccccctccg 
tgtgcccctg 
aaataaatat 

ggctgggaga 



gactcagacc 
agctgcccgc 
tccccctgca 
taaatagttt 
aattgggatt 
actttgtaca 



cctatccacc 
cctcgcccgc 
cggactcatc 
taaatatgaa 
ttaaaaacat 
gtggagaaat 



cccacccacg 
caccgagagg 
ctgacctcgg 
caaagaaaaa 
gagaaatgtg 
atttataaac 



180 
240 
300 
360 
420 
480 
496 



<210> 63 

<211> 3081 

<212> DMA 

<213> Homo sapiens 

<400> 63 

cccgccagcc cagcccagcc caaccctact ccctccccac gccagggcag cagccgfctgc 60 

tcagagagaa ggtggaggaa gaaatccaga ccctagcacg cgcgcaccat catggaccat 120 

tatgattctc agcaaaccaa cgattacatg cagccagaag aggactggga ccgggacctg 180 

ctcctggacc cggcctggga gaagcagcag. agaaagacat tcacggcatg gtgtaactcc 240 

cacctccgga aggcggggac acagatcgag aacatcgaag aggacttccg ggatggcctg 3 00 

aagctcatgc tgctgctgga ggtcatctca ggtgaacgct tggccaagcc agagcgaggc 3 60 

aagatgagag tgcacaagat ctccaacgtc aacaaggccc tggatttcat agccagcaaa 42 0 

ggcgtcaaac tggtgtccat cggagccgaa gaaatcgtgg atgggaatgt gaagatgacc 4 80 

ctgggcatga tctggaccat catcctgcgc tttgccatcc aggacatctc cgtggaagag 540 

acttcagcca aggaagggct gctcctgtgg tgtcagagaa agacagcccc ttacaaaaat 600 

gtcaacatcc agaacttcca cataagctgg aaggatggcc tcggcttctg tgctttgatc 660 

caccgacacc ggcccgagct gattgactac gggaagctgc ggaaggatga tccactcaca 720 

aatctgaata cggcttttga cgtggcagag aagtacctgg acatccccaa gatgctggat 780 

gccgaagaca tcgttggaac tgcccgaccg gatgagaaag ccatcatgac ttacgtgtct 840 

agcttctacc acgccttctc tggagcccag aaggcggaga cagcagccaa tcgcatctgc 900 

aaggtgttgg ccgtcaacca ggagaacgag cagcttatgg aagactacga gaagctggcc 960 

agtgatctgt tggagtggat ccgccgcaca atcccgtggc tggagaaccg ggtgcccgag 102 0 

aacaccatgc atgccatgca acagaagctg gaggacttcc gggactaccg gcgcctgcac 108 0 

aagccgccca aggtgcagga gaagtgccag ctggagatca acttcaacac gctgcagacc 114 0 

aagctgcggc tcagcaaccg gcctgccttc atgccctctg agggcaggat ggtctcggac 1200 

atcaacaatg cctggggctg cctggagcag gtggagaagg gctatgagga gtggttgctg 1260 

aatgagatcc ggaggctgga gcgactggac cacctggcag agaagttccg gcagaaggcc 1320 

tccatccacg aggcctggac tgacggcaaa gaggccatgc tgcgacagaa ggactatgag 13 80 
accgccaccc tctcggagat caaggccttg ctcaagaagc atgaggcctt cgagagtgac ' 1440 

ctggctgccc accaggaccg tgtggagcag attgccgcca tcgcacagga gctcaatgag 1500 

ctggactatt atgactcacc cagtgtcaac gcccgttgcc aaaagatctg tgaccagtgg 1560 

gacaatctgg gggccctaac tcagaagcga agggaagctc tggagcggac cgagaaactg 162 0 

ctggagacca ttgaccagct gtacttggag tatgccaagc gggctgcacc cttcaacaac 1680 

tggatggagg gggccatgga ggacctgcag gacaccttca ttgtgcacac cattgaggag 174 0 

atccagggac tgaccacagc ccatgagcag ttcaaggcca ccctccctga tgccgacaag 18 0 0 

gagcgcctgg ccatcctggg catccacaat gaggtgtcca agattgtcca gacctaccac 1860 

gtcaatatgg cgggcaccaa cccctacaca accatcacgc ctcaggagat caatggcaaa 192 0 

tgggaccacg tgcggcagct ggtgcctcgg agggaccaag ctctgacgga ggagcatgcc 1980 

cgacagcagc acaatgagag gctacgcaag cagtttggag cccaggccaa tgtcatcggg 2 040 

ccctggatcc agaccaagat ggaggagatc gggaggatct ccattgagat gcatgggacc 210 0 

ctggaggacc agctcagcca cctgcggcag tatgagaaga gcatcgtcaa ctacaagcca 2160 

aagattgatc agctggaggg cgaccaccag ctcatccagg aggcgctcat cttcgacaac 222 0 

aagcacacca actacaccat ggagcacatc cgtgtgggct gggagcagct gctcaccacc 22 80 
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atcgccagga ccatcaatga ggtagagaac cagatcctga cccgggatgc caagggcatc 
agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 

ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 

aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 

ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 

gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 

accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 

atggccccct acaccggccc cgactccgtg ccaggtgctc tggactacat gtccttctcc 

acggcgctgt acggcgagag tgacctctaa tccaccccgc ccggccgccc tcgtcttgtg 

cgccgtgccc acagatgtga aatgaatgta atctaataga agcctaatca gcccaccatg 

ttctccactg aaaaatcctc tttctttggg gtttttcttt ctttcttttt tgattttgca 

ctggacggtg acgtcagcct gtacaggctc ccaggggtgg cgtcaaatgc tattgaaatt 

gcgctgaatc gtatgctttt tccttttgat aaataaacaa tgtaaaaatg tttcaaaaac 
ctaataaaat aaataaatac g 

<210> 64 
<211> 1324 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1) . . . (1324) 
<223> n a A,T,C or G 



ggccgcccgg cgcccccagc agnccgagcc ggggcgcaca gncggggcgc agcccgcgcc 
ccccgccgcg attgacatga tgtttccaca aagcaggcat tcgggctcct cgcacctacc 
ccagcaactc aaattcacca cctcggactc ctgcgaccgc atcaaagacg aatttcagct 
actgcaagct cagtaccaca gcctcaagct cgaatgtgac aagttggcca gtgagaagtc 
agagatgcag cgtcactatg tgafcgtacta cgagatgtcc tacggcttga acatcgagat 
gcacaaacag gctgagatcg tcaaaaggct gaacgggatt tgtgcccagg tcctgcccta 
cctctcccaa gagcaccagc agcaggtctt gggagccatt gagagggcca agcaggtcac 
cgctcccgag ctgaactcta tcatccgaca gcagctccaa gcccaccagc tgtcccagct 
gcaggccctg gccctgccct tgaccccact acccgtgggg ctgcagccgc cttcgctgcc 
ggcggtcagc gcaggcaccg gcctcctctc gctgtccgcg ctgggttccc aggcccacct 
ctccaaggaa gacaagaacg ggcacgatgg tgacacccac caggaggatg atggcgagaa 
gtcggattag cagggggccg ggacagggag gttgggaggg gggacagagg ggagacagag 
gcacggagag aaaggaatgt ttagcacaag acacagcgga gctcgggatt ggctaatctc 
ccatagtatt tatggtggcg ccggcggggc cccagcccag cttgcaggcc acctctagct 
ttcttcctac cccattccgg cttccctcct cctcccctgc agcctggtta ggtggatacc 
tgccctgaca tgtgaggcaa gctaaggcct ggagggtcag atgggagacc aggtcccaag 
ggagcaagac ctgcgaagcg cagcagcccc ggcccttccc ccgttttgaa catgtgtaac 
cgacagtctg ccctgggcca cagccctctc accctggtac tgcatgcacg caatgctagc 
tgcccctttc ccgtcctggg caccccgagt ctcccccgac cccgggtccc aggtatgctc 
ccacctccac ctgccccact caccacctct gctagttcca gacacctcca cgcccacctg 
gtcctctccc atcgcccaca aaaggggggg cacgagggac gagcttagct gagctgggag 
gagcagggtg agggtgggcg acccaggatt ccccctcccc ttcccaaata aagatgaggg 
tact 

<210> 65 
<211> 2377 
<212> DNA 
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<213> Homo sapiens 
<400> 65 

ggtgacaaag agccaacaga gacaatagga gacttgtcaa tttgtcttga tgggctacag 
ttagagtctg aagttgttac caatggtgaa actacatgtt cagaaagtgc ttctcagaat 
gatgatggct ccagatccaa ggatgaaaca agagtgagca caaatggatc agatgaccct 
gaagatgcag gagctggtga aaataggaga gtcagtggga ataattctcc atcactctca 
aatggtggtt ttaaaccttc tagacctcca agaccttcac gaccaccacc acccacccca 
cgtagaccag catctgtcaa tggttcacca tctgccactt ctgaaagtga tgggtctagt 
acaggctctc tgccgccgac aaatacaaat acaaatacat ctgaaggagc aacatctgga 
ttaataattc ctcttactat atctggaggc tcaggcccta ggccattaaa tcctgtaact 
caagctccct tgccacctgg ttgggagcag agagtggacc agcacgggcg agtttactat 
gtagatcatg ttgagaaaag aacaacatgg gatagaccag aacctctacc tcctggctgg 
gaacggcggg ttgacaacat gggacgtatt tattatgttg accatttcac aagaacaaca 
acgtggcaga ggccaacact ggaatccgtc cggaactatg aacaatggca gctacagcgt 
agtcagcttc aaggagcaat gcagcagttt aaccagagat tcatttatgg gaatcaagat 
ttatttgcta catcacaaag taaagaattt gatcctcttg gtccattgcc acctggatgg 
gagaagagaa cagacagcaa tggcagagta tatttcgtca accacaacac acgaattaca 
caatgggaag accccagaag tcaaggtcaa ttaaatgaaa agcccttacc tgaaggttgg 
gaaatgagat tcacagtgga tggaattcca tattttgtgg accacaatag aagaactacc 
acctatatag atccccgcac aggaaaatct gccctagaca atggacctca gatagcctat 
gttcgggact tcaaagcaaa ggttcagtat ttccggttct ggtgtcagca actggccatg 
ccacagcaca taaagattac agtgacaaga aaaacattgt ttgaggattc ctttcaacag 
ataatgagct tcagtcccca agatctgcga agacgtttgt gggtgatttt tccaggagaa 
gaaggtttag attatggagg tgtagcaaga gaatggttct ttcttttgtc acatgaagtg 
ttgaacccaa tgtattgcct gtttgaatat gcagggaagg ataactactg cttgcagata 
aaccccgctt cttacatcaa tccagatcac ctgaaatatt ttcgttttat tggcagattt 
attgccatgg ctctgttcca tgggaaattc atagacacgg gtttttcttt accattctat 
aagcgtatct tgaacaaacc agttggactc aaggatttag aatctattga tccagaattt 
tacaattctc tcatctgggt taaggaaaac aatattgagg aatgtgattt ggaaatgtac 
ttctccgttg acaaagaaat tctaggtgaa attaagagtc atgatctgaa acctaatggt 
ggcaatattc ttgtaacaga agaaaataaa gaggaataca tcagaatggt agctgagtgg 
aggttgtctc gaggtgttga agaacagaca caagctttct ttgaaggctt taatgaaatt 
cttccccagc aatatttgca atactttgat gcaaaggaat tagaggtcct tttatgtgga 
atgcaagaga ttgatttgaa tgactggcaa agacatgcca tctaccgtca ttatgcaagg 
accagcaaac aaatcatgtg gttttggcag tttgttaaag aaattgataa tgagaagaga 
atgagacttc tgcagtttgt tactggaacc tgccgattgc cagtaggagg atttgctgat 
ctcatgggga gcaatggacc acagaaattc tgcattgaaa aagttgggaa agaaaattgg 
ctacccagaa gtcatacctg ttttaatcgc ctggacctgc caccatacaa gagctatgag 
caactgaagg aaaagctgtt gtttgccata gaagaaacag aaggatttgg acaagagtaa 
cttctgagaa cttgcaccat gaatgggcaa gaacttattt gcaatgtttg tccttctctg 
cctgttgcac atcttgtaaa attggacaat ggctctttag agagttatct gagtgtaagt 
aaattaatgt tctcatttaa aaaaaaaaaa aaaaaaa 

<210> 66 
<211> 1295 
<212> DNA 

<213> Homo sapiens 



gggccgccgc ccacccgggc cttgcctcta cctcagtcgt tgccccccga ttttcggctg 
gagcccacgg ccccggccct cagcccccgc tctagcttcg ccagtagctc ggccagcgac 
gcgagcaagc cgtccagccc ccggggcagc ctgctgctgg acggggcggg ggctggcgga 
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gctggaggta gccggccctg cagcaatcgc accagcggca tcagcatggg ctacgaccag 
cgccacggga gccccttgcc agcggggccg tgcctgtttg gcccacccct ggccggagca 
ccggcaggct attctcccgg aggggtcccg tccgcctacc cggagctcca cgccgccctg 
gaccgattgt acgctcagcg gcccgcgggg ttcggctgcc aggaaagccg ccactcgtat 
cccccggccc tgggcagccc tggagctcta gccggggccc gagtgggagc ggcggggccc 
ttggagagac ggggggcgca acccggacga cactctgtga ccggctacgg ggactgcgcc 
gtgggcgccc ggtaccagga cgagctaaca gctttgcttc gcctgacggt gggcaccggt 
gggcgagaag ccggagcccg cggagaaccc tcggggattg agccgtcggg tctggaggag 
ccaccaggtc ctttcgttcc ggaggccgcc cgggcccgga tgcgggagcc agaggccagg 
gaggactact tcggcacctg tatcaagtgc aacaaaggca tctatgggca gagcaatgcc 
tgccaggccc tggacagcct ctaccacacc cagtgctttg tttgctgctc ttgtgggcga 
actttgcgtt gcaaggcttt ctacagtgtc aatggctctg tgtactgtga ggaagattat 
ctgttttcag ggtttcagga ggcagctgag aaatgctgtg tctgtggtca cttgattttg 
gagaagatcc tacaagcaat ggggaagtcc tatcatccag gctgtttccg atgcattgtt 
tgcaacaagt gcctggatgg catccccttc acagtggact tctccaacca agtatactgt 
gtcaccgact accacaaaaa ttatgctcct aagtgtgcag cctgtggcca acccatcctc 
ccctctgagg gctgtgagga catcgtgagg gtgatatcca tggaccggga ttatcacttt 
gagtgctacc actgtgagga ctgccggatg cagctgagtg atgaggaagg ctgctgctgt 
ttccctctgg atgggcactt gctctgccat ggttg 

<210> 67 
<211> 3411 . 
:<212> DNA 
<213> Homo sapiens 



gggcccgggg tcccgccacc accgcgcgcg ggacagattg attcactttg gagctgtaag 
tactgatgta ttagggtgca gcgctcattg ttcattgacg cagagtccca aaatgaatat 
ccaagagcag ggtttcccct tggacctcgg agcaagtttc accgaagatg ctccccgacc 
cccagtgcct ggtgaggagg gagaactggt gtccacagac ccgaggcccg ccagctacag 
tttctgctcc gggaaaggtg ttggcattaa aggtgagact tcgacggcca ctccgaggcg 
ctcggatctg gacctggggt atgagcctga gggcagtgcc tcccccaccc caccatactt 
gaagtgggct gagtcactgc attccctgct ggatgaccaa gatgggataa gcctgttcag 
gactttcctg aagcaggagg gctgtgccga cttgctggac ttctggtttg cctgcactgg 
cttcaggaag ctggagccct gtgactcgaa cgaggagaag aggctgaagc tggcgagagc 
catctaccga aagtacattc ttgataacaa tggcatcgtg tcccggcaga ccaagccagc 
caccaagagc ttcataaagg gctgcatcat gaagcagctg atcgatcctg ccatgtttga 
ccaggcccag accgaaatcc aggccactat ggaggaaaac acctatccct ccttccttaa 
gtctgatatt tatttggaat atacgaggac aggctcggag agccccaaag tctgtagtga 
ccagagctct gggtcaggga cagggaaggg catatctgga tacctgccga ccttaaatga 
agatgaggaa tggaagtgtg accaggacat ggacgaggac gatggcagag acgctgctcc 
ccccggaaga ctccctcaga agctgctcct ggagacagct gccccgaggg tctcctccag 
tagacggtac agcgaaggca gagagttcag gtatggatcc tggcgggagc cagtcaaccc 
ctattatgtc aatgccggct atgccctggc cccagccacc agtgccaacg acagcgagca 
gcagagcctg tccagcgatg cagacaccct gtccctcacg gacagcagcg' tggatgggat 
ccccccatac aggatccgta agcagcaccg cagggagatg caggagagcg cgcaggtcaa 
fc gggcgggtg cccctacctc acattccccg cacgtaccgg gtgccgaagg aggtccgcgt 
ggagcctcag aagttcgcgg aggagctcat ccaccgcctg gaggctgtgc agcgcacgcg 
ggaggccgag gagaagctgg aggagcggct gaagcgcgtg cgcatggagg aggaaggtga 
ggacggcgat ccatcgtcag ggcccccagg gccgtgtcac aagctgcctc ccgcccccgc 
ttggcaccac ttcccgcccc gcttgtgttg gacatgggct tgtgccgggc tccgggatgc 
acacgaggag aaccctgaga gcatcctgga cgagcacgta cagcgtgtgc tgaggacaac 
tggccgccag tcgcctgggc ctggccatcg ctccccggac agtgggcacg tggccaagat 
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gccagtggca ctggggggtg ccgcctcggg" gcacgggaag cacgtaccca agtcaggggc 1680 

gaagctggac gcggccggcc tgcaccacca ccgacacgtc caccaccacg tccaccacag 1740 

cacagcccgg cccaaggagc aggtggaggc cgaggccacc cgcagggccc agagcagctt 18 00 

cgcctggggc ctggaaccac acagccatgg ggcaaggtcc cgaggctact cagagagtgt 1860 

tggcgctgcc cccaacgcca gtgatggcct cgcccacagt gggaaggtgg gcgttgcgtg 192 0 

caaaagaaat gccaagaagg ctgagtcggg gaagagcgcc agcaccgagg tgccaggtgc 198 0 

ctcggaggat gcggagaaga accagaaaat catgcagtgg atcattgagg gggaaaagga 2 04 0 

gatcagcagg caccgcagga ccggccacgg gtcttcgggg acgaggaagc cacagcccca 210 0 

tgagaactcc agaccyttgt cccttgagca cccctgggcc ggccctcagc tccggacctc 2160 

cgtgcagccc tcccacctct tcatccaaga ccccaccatg ccaccccacc cagctcccaa 2220 

ccccctaacc cagctggagg aggcgcgccg acgtctggag gaggaagaaa agagagccag 2280 

ccgagcaccc tccaagcaga ggtatgtgca ggaggttatg cggcggggac gcgcctgcgt 234 0 

caggccagcg tgcgcgccgg tgctgcacgt ggtaccagcc gtgtcggaca tggagctctc 240 0 

cgagacagag acaagatcgc agaggaaggt gggcggcggg agtgcccagc cgtgtgacag 2460 

catcgttgtg gcgtactact tctgcgggga acccatcccc taccgcaccc tggtgagggg 252 0 

ccgcgctgtc accctgggcc agttcaagga gctgctgacc aaaaagggca gctacagata 258 0 

ctacttcaag aaagtgagcg acgagtttga ctgtggggtg gtgtttgagg aggttcgaga 2640 

ggacgaggcc gtcctgcccg tctttgagga gaagatcatc ggcaaagtgg agaaggtgga 2700 

ctgataggct ggtgggctgg ccgctgtgcc aggcgaggcc cttggcgggc acgggtgtca . 2760 

cggccaggca gatgacctcg tactcaggag cccgatgggg aacagtgttg ggtgtaccac 2820 

ccatccctgt ggtctacccg tgtctagagg caggtagggg gtccctccaa gtggtccaca ■" 288 0 

agcttctgtc ctgcccccaa ggaggcagcc tggaccactc ctcatagcaa tacttggagg 294 0 

gcccagccca agtgaggcag ccgaggtccc tgctgccagc ttcaggtgac ccccccccat 3000 

cccccggcac ctcccttggg cacgtgtgct gggatctact ttccctctgg gatttgccca 3060 

cgtacccagg tctggctggg gcccaggccc ggatgcagag gcctgcaggg cctctgtcaa 312 0 

ttgtacgcgc caccaagtgc cttcaacaca gcttgtctct tgcctgccac tgtgtgaatc 318 0 

ggcgacggag cactgcacct gcctccagcc gccggctgtg cagtcctggg tcctcctttc 3240 

tgagggcccg tgtaaatatg tacatttctc aggctagggc cagcaggggc tgcccgagtc 33 0 0 

tgtttttcat gcgatgacac ttgtacaatt atcttttcaa aggtacttgg ataataatga 3360 

aataaaactg tttttgaacc tgaataaaaa aaaaaaaaaa aaaaaaaaaa a 3411 

<210> 68 
<211> 3140 
<212> DNA 

<213> Homo sapiens 
<400> 68 

ggctgcgagt acctccatgg tcccggtggc tgtgacggcg gcagtggcgc ctgtcctgtc 60 

cataaacagc gatttctcag atttgcggga aattaaaaag caactgctgc ttattgcggg 12 0 

ccttacccgg gagcggggcc tactacacag tagcaaatgg tcggcggagt tggctttctc 18 0 

tctccctgca ttgcctctgg ccgagctgca accgcctccg cctattacag aggaagatgc 24 0 

ccaggatatg gatgcctata ccctggccaa ggcctacttt gacgttaaag agtatgatcg 30 0 

ggcagcacat ttcctgcatg gctgcaatag caagaaagcc tattttctgt atatgtattc 360 

cagatatctg tctggagaaa aaaagaagga cgatgaaaca gttgatagct taggccccct 42 0 

ggaaaaagga caagtgaaaa atgaggcgct tagagaattg agagtggagc teagcaaaaa 48 0 

acaccaagct cgagaacttg atggatttgg actttatctg tatggtgtgg tgcttcgaaa 54 0 

actggacttg gttaaagagg ccattgatgt gtttgtggaa gctactcatg ttttgccctt 60 0 

gcattgggga gcctggttag aactctgtaa cctgatcaca gacaaagaga tgctgaagtt 66 0 

"cctgtctttg ccagacacct ggatgaaaga gttttttctg gctcatatat acacagagtt 72 0 

gcagttgata gaggaggccc tgcaaaagta tcagaatctc attgatgtgg gcttctctaa 78 0 

gagctcgtat attgtttccc aaattgcagt tgcctatcac aatatcagag atattgacaa 84 0 

agccctctcc atttttaatg agctaaggaa acaagaccct tacaggattg aaaatatgga 90 0 

cacattctcc aaccttcttt atgtcaggag catgaaatcg gagttgagtt atctggctca 960 
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taacctctgt gagattgata aataccgtgt agaaacgtgc tgtgtaattg gcaattatta 102 0 

cagtttacgt tctcagcatg agaaagcagc cttatatttc cagagagccc tgaaattaaa 1080 

tcctcggtat cttggtgcct ggacactaat gggacatgag tacatggaga tgaagaacac 1140 

gtctgctgct atccaggctt atagacatgc cattgaggtc aacaaacggg actacagagc 120 0 

ttggtatggc ctcgggcaga cctatgaaat ccttaagatg ccattttact gcctttatta 1260 

ttatagacgg gcccaccagc ttcgacccaa tgattctcgc atgctggttg ctttaggaga 1320 

atgttacgag aaactcaatc aactagtgga agccaaaaag tgttattgga gagcttacgc 1380 

cgtgggagat gtggagaaaa tggctctggt gaaactggca aagcttcatg aacagttgac 144 0 

tgagtcagaa caggctgccc agtgttacat caaatatatc caagatatct attcctgtgg 150 0 

ggaaatagta gaacacttgg aggaaagcac tgcctttcgc tatctggccc agtactattt 1560 

taagtgcaaa ctgtgggatg aagcttcaac ttgtgcacaa aagtgttgtg catttaatga 1620 

tacccgggaa gaaggtaagg ccttactccg gcaaatccta cagcttcgga accaaggcga 1680 

gactcctacc accgaggtgc ctgctccctt tttcctacct gcttcactct ctgctaacaa 1740 

tacccccaca cgcagagttt ctccactcaa cttgtcttct gtcacgccat agttggctac 1800 

tctcaagcca gcacafctgtt agacccatct taattaagcc ttacctccat gtaaagaaca 1860 

gcacgtctgt tccaaggacc tcagctcttc ttgtttctac agatggcaac agctccatag 192 0 

ggacagcttg tataattacc ttcagaggcc aactgacaga atcctggcag gaacagacat 1980 

tatcttgcca gttagaagta cttctgtctc acttatgtcc aaagagtggc tatagatctt 2 040 

ggccttcttc cctgaatggt tttttttttt ggcccccaag aaagtccctt ttatagcact 2100 

tfcagcacagg caatgctaca ggaacaaagt ttcaatgctg ctgagagtga aagaaaggag 2160 

gaaagtctgc cactctaccc tgagctggca gtagggcact gagtaccctt aggaagaagt 222 0 

cagagcaatg gatacaaatg accttgctct tggatttgct gagcatgatc cctattctga 2280 

tgtcagagat taggtttaaa tggaatagag ctatccattt gttcttactc tctagggaga 2340 

caatcttcca aaacagtttt gggggggtct tctaaagctt tcaaattgga agtaacttta 2400 

ttcaactaga gttgaataaa agaagggcaa aaataatctc acagagcttg gaactgctga 2460 

tagcccttac tgagggcaaa agatggctat attgttagct atactcctac caaagcaagc 2520 

aaggagatag gattatagat aatttcacgg acatttggaa ataacattgg tgattataca 2580 

gacaagaata aactcacttc aagctggtct gttttaataa attttcaacg taattgtcta 2640 

tttttttccc tcccatctgc aacagaatac atttttttca gcctttatct agatgaggta 2700 

aagggaatca ttcttatggt gctcttggag agtttcaggc ctgtgcatgt gtgtacagca 2760 

ggaggtaata tgctataatg tctgctgtaa tatatttgca cagtagatgc tatggatcat 2820 

tctgagctca gggtccagac tttattctta ttcccagaat tttgtgttac gtttttacct 2880 

cctaacatat gacacttcat cttatattaa ggaaggttta gaatatctaa tacgacttga 2940 

attcatttgt tactaagcct tctcaggcaa gctgtatact agttactggt ctccactgcc 3000 

atgccttttc aaggttccca tggtccagaa tgatgtttga ttcttaattt ttctgtccct 3060 

tttataattt gttttaatga ttttgctaca tttggaattc aataaaaaat gtgaacaata 312 0 

ataaaaaaaa aaaaaaaaaa 3140 

<210> 69 

<211> 3513 

<212> DNA 

<213> Homo sapiens 

<400> 69 

ccgtgtacca ggtgctgcta gtgggaagca cgctgctgaa ggaagtgcct tccgggctgc 60 

agctggagca gttgccttct cagagcctgc tgacccacat cccaacggcg gggctgccca 120 

cttcgctagg aggaggcctg ccttactgcc accaggcctg gctggatttc cgaaggcggc 180 

tggaagctct actacagaac tgccaggcag cttgtgccct gctccagggg gccatcgaaa 240 

gtgtgaaggc tgtgccccag cccatggagc ctggggaggt cggtcagctg ctacagcaga 3 00 

cagaggtcct gatgcagcag gtgctagact cgccatggct ggcatggcta caatgccagg 3 60 

ggggccggga gctgacatgg ctgaagcaag aggtcccaga ggtgaccctg agcccagact 420 

acaggacggc aatggacaag gctgacgagc tatatgaccg ggtggatgga ttgctgcacc 4 80 

aactgaccct gcagagcaac cagcgaatac aggccctaga gttggtccaa acactggagg 54 0 
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cccgggaaag cggactgcac cagattgaag tgtggctgca gcaggtgggc tggccagcac 600 

tggsggaggc tggggagccc tcgctggaca tgctgctcca ggcccaaggc tcttttcagg 660 

agctgtacca ggttgcccag gagcaggtca ggcaagggga gaagtttctg cagccgctga 72 0 

ctggctggga ggcggctgaa ctggaccccc ctggggcacg ctttctggcc ctgcgagccc 780 

agctgactga attctctagg gctttggccc agcggtgcca gcggctggcg gatgctgaga 840 

ggctgtttca gctcttcagg gaggccttga cgtgggctga ggaggggcag cgagtgttgg 900 

cagagctgga gcaggaacgc ccgggggttg tgttgcagca gctgcagctg cactggacca 960 

ggcaccctga cttgcctcct gcccacttcc gaaagatgtg ggctctggcc acggggctgg 1020 

gctcagaggc catccgccag gagtgccgct gggcctgggc gcggtgccag gacacctggc 1080 

tggccctgga ccaaaagctt gaggcttcac tgaagctacc accggtgggc agcacagcta 114 0 

gcctgtgtgt cagccaggtc cccgctgcac ctgcccaccc tcccctgagg aaggcctaca 1200 

gcttcgatcg gaatctgggg cagagtctca gtgaacctgc ctgccactgc caccatgcgg 12 60 

ccactattgc tgcctgccgc agaccagagg ctggaggagg tgccctgccc caggcatccc 132 0 

ctactgtgcc tccaccaggc agctctgacc ccaggagcct caacaggcta cagctggtgc 13 80 

tggcagagat ggtggccacg gagcgggagt atgtccgggc tctagagtac actatggaga 144 0 

actatttccc cgagctggat cgccccgatg tgccccaggg cctccgcggt cagcgtgccc 1500 

acctctttgg caacctggag aagctgcggg acttccactg ccacttcttc ctgcgtgagc 1560 

tggaggcttg cacccggcac ccaccacgag tggcctatgc cttcctgcgc catagggtgc 1620 

agtttgggat gtacgcgctc tacagcaaga ataagcctcg ctccgatgcc ctgatgtcaa 1680 

gctatgggca caccttcttc aaggacaagc agcaagcact gggggaccac ctggacctgg 1740 

cctcctacct gctaaagccc atccagcgca tgggcaagta cgcactgctg ctgcaggagc 1800 

tggcacgggc ctgcgggggc cccacgcagg agctcagtgc gctgcgggag gcccagagcc 1860 

ttgtgcactt ccagctgcgg cacggaaacg acctgctggc catggacgcc atccagggct 192 0 

gtgatgttaa cctcaaggaa caggggcagc tggtgcgaca ggatgagttt gtggtgcgca 1980 

ctgggcgcca caagtccgtg cgccgcatct tcctttttga ggagctgctg ctcttcagca 2 040 

agcctcgcca tgggcccaca ggggttgaca catttgccta caagcgctcc ttcaagatgg 2100 

cagaccttgg tctcactgag tgctgtggga acagcaacct gcgcttcgag atctggttcc 2160 

gccgccgcaa ggccagggac acctttgtgc tgcaggcctc cagcctggct atcaagcagg 2220 

cctggacagc tgacatctcc cacctgcttt ggaggcaggc cgtccacaac aaggaggtgc 2280 

gcatggctga gatggtgtcc atgggtgtgg ggaacaaggc cttccgagac attgctccca 2340 

gcgaggaagc catcaacgac cgcaccgtca actatgtcct gaagtgccga gaagttcgct 2400 

ctcgggcgtc cattgccgta gccccgtttg accatgacag cctctacctg ggggcctcga 2460 

actcccttcc tggagaccct gcctcttgct ctgttctggg gtccctcaac ctgcacctgt 252 0 

acagagaccc agctcttctg ggtctccgct gtcccctgta tcccagcttc ctagaggaag 2580 

cage act gga ggctgaggca gagctgggcg gccagccctc tttgactget gaggactcag 264 0 

agatctegtc ccaatgccca teagecagtg gctccagtgg ctctgacagc agctgtgtgt 270 0 

cagggcaggc cctgggtagg ggcctggagg acttaccctg tgtctgagee cgggactgga 2760 

cgagcagtag atccagcagc ctgcagctcc aaggaacatt gcctctctgg atctgctgtg 2 820 

accagggtgt ggctgacacc tgggctacct ccaacctaca tgtgcaacgc tgttgactac 2 880 

cctttctgat gtgtgtggcc attggactaa ctggcacggg gcctctctag ggaagtctgg 2 940 

ttgtagagcc tgaataggct cctggcccca tgaccccttc tcctgtcccc agctcccatc 3000 

ccagttgtgg gttaagaata ggctagagca gacattgggt gtttccatgc tgtaggctgg 3 060 

tgggggacca tgtgcctcta ggcagtgact agggtgcccc cacccctcag gaagaacaca 3120 

99tgggctcc tagcagctga tccccaatgc ctggccttaa agccgagctc agttaccata 318 0 

gggacaggtc cacctctact gggccctcat gettgecttt cctggccccc aggcccagcc 3240 

cctttttact ggggcagttt cgttattttg acttgatgee ttttgaataa ctttcaatag 3 3 00 

aattgtctaa aattatctta ctggttgtta ggcctttggt gtctcagaga aggagtctag 3360 

gtctttgatg tgtgatttaa tcttttattt gtttataata aaaaatagac tgatttgtaa 3420 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 34 80 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 3 513 

<210> 70 
<211> 3597 
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<212> DNA 

<213> Homo sapiens 

<400> 70 

catgccagtt acttcctcag gaaaatattt tcttgccttc ttctttcagt atggttttaa 60 

atttgggaac agtggataac ccaagtgtcc cacaggccaa ggtatattcc aatggcagca 12 0 

tgatccctgc acccaaagcc agcccctaaa gcctacccct tgtgcacccg cagcctggta 180 

agtgagcttg gctgcttgtg aggagctaca- agtgaaagag aagttatttt aaataaatcc 24 0 

caaagtttga ggcagactgt ccaggactgt tcccaggaag aagcaggagt tacccacagg 3 00 

aaaagtctct gacctggtcc cctcaggccc agctacctgc gcccaccagc agtgaaggtt 360 

gatgtactgg cccagcatct ccacctcccc catgcaacca ggtccctggt accgtgtctc 42 0 

ccgttgcatg tctggcttct gcctgtgctc . ctcctgccac gagcatcctc cctgtccctc 480 

ctcattccac cgtgtctctc ctgcacacat agcctctgtc ccagggcgat ttatccactt 540 

gagtacagga gctgctcaga cctctcagcc cagccctctg tgactgcccc agccccatcc 600 

taccccaccc aaagctgcct tcctggctgt aggagctccc tcgtctagcc aaggccctat 660 

gggtccccat ccgaggatcc acaagcaatg acttcccaaa tgacctccac tgcaagaaga 72 0 

atccttacca ctgtttccag agccgtgaac gatgctgtga tggcccaggt ctcagcacca 780 

ccctctgtga cctaaaaaga aaagctcaat ttccatctgt cttctttccc aggaccaagg 84 0 

ggacacagta atgtgaagtc aaatacttaa ccgagcaaag ggccagtgtt gttatcagtc . 900 

aaggacaaac ctcccacctc acagacagcc aagcagtgag ggaaagacag acagacatag 960 

gtaggaaggt gctctgcagg cacaaggccc agagaagccc ctctccggga acttcccctg 102 0 

ctccttccag gaacagtgag cccagtgagc agtcccagcc agctcttcaa ggccttcaag 1080 

gggtctttcc atgactgagt cacctccagg agctcacctg acccccagag aagacctacc 114 0 

ccaggcagct ccgtgccctg gcttctcccc atgccccaaa tcccccccag ccatccctcc 12 0 0 

tggtcctcgt ctacatcaag ggcctcttcc cctcttcctc ccagctctca ggacaggtga 12 60 

ctgggagacc ttgaaccctc agcctcttcc tttaaaaaaa acaaaacaaa acaaaactgt 132 0 

gggccattta tttgggattt tggagttgtt tggtttttgt ttttatatct taatagttcg 13 8 0 

aaagtaagaa gggagccctg ctatggatgt taagtccaaa ttactcggtt agtgggagca 1440 

aaacctatga cttccaaggg gatgaggaga ggttcagagg acaggaggag cctcccccat 15 00 

tgaaaaaaaa aaaatgggtc aggacattcc ctggatgagg acaatgctag gggtggcatc 1560 

tcacatggct gctgctattc ctggtgcttc cccacacttt tgacagatgg agtccttctc 1620 

ctaccgcctc ctgccacctc accctacagg cattctctat gtaggaaaca agagccttat 1680 

cttatagagt ggggagctga gacacagcct caggtaacac tgacacagct cccgaatgag 1740 

gctgggacac tctgcaaacc tctcctcatg gtgctaaggg tggcatgctc ttgacaggaa 1800 

acctaaatga ccactcctct catttggaaa gtaatccact gcagtaaaag tttcagacat 1860 

gcaagagaga gttttttttt ttttttacta caaatttttg ctcccccata aaattatttt 1920 

tttattagag ggagtatcca agttttaaaa gtatatagaa ttttttggtt gtaagagaaa 1980 

tacatactca ttaggatccc gattaaattc cttgagtaga ctggtgccta ccagaaagca 2 040 

aagcaaagtt aaacaaaacg aaacaaaatc cttcatatac aaaaagaact ttctgtttgt 2100 

attggcagag gtagtgaggt gattcaggta ggctgaaaat cctgggttgc gggagcctca 2160 

ctttattcca ttcccacccg ctttgatgtc tatgcfctggc tctctgggct gcccctggta 2220 

ctgccgaatc ctacacatct cttatcagct ttcctcaaac tttaaggagg ctctgtgagg 22 80 

gatgggtcat gggaagaccc aagctttccc tccgccagga ttgcaaaagc aagtagactt 2340 

ggtctatgca gctcttcttc caacaatttc tttatttgga attagaactt cctttgttag 2400 

tatctttgat cttttgactc aagcacattt tggaagggct cccttacaaa agtagaattt 2460 

aaaacagagg atacagttaa agagcaaccc aaaggacgct taagaaaccg agaccacttc 2520 

accgaacagg actaaggaac actttcgtgc acagaagtca gccgcaatcc aggcacagga 2580 

cgaagatggg atacacgtgc tcatctgtct gtcctccttt cctctccctc cccgacgttc 2 640 

tagttagctt gttgacttgt taaaccttct gttcttaaaa tgaaaagcta gcttacctca 2700 

aagaatcttg tttccattcg gaaaccaacg attttgtgtt ttagaatgga cagccctccc 2760 

ctcaccactc cctaccttgg cctggtgtcc ttgagacata cggtctttgc ttagtcgtgt 2820 

gttggctgct ttgagcagga acgaggcctc caggccctga ggtgggaagg aaggattgga 2 880 

tgccactgcc ctcctcccca ctttagcatg taggggccag cccatctctt ccagcagggt 2 94 0 
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cctgctgagt taccatagca accagcaact ccagggtacc acaacagaca atggctcagc 3000 

gagccgacgt gtggggatga tgcaggggtt ttggcccagc cagaggaccc agagttgagc 3 060 

ttcaaatgct agagaagggg agaaacagga tggaagggtg gtttaaggaa ctggcagggg 312 0 

tctttgagtc acatagagaa gccgttgaag gaggtagggc aggttatctc tgttccagtc 3180 

acccccttcc agccccatcc cacttctgtt tcaaactaaa gctcccacct cgaacattga 3240 

ccctttgtta gaacaaagca aagcatatct ttaaacaaca gtgttaaaat gagcctcaaa 33 00 

tgtatgtgga tgagatctct aagaagaggg tcttctggtt ttgattttta aagaagagta 3360 

tcctagtaaa atattaaaaa aaaattaaaa agtttttaaa aaggaaacct atgctattta 3420 

aattggagcc cagttgtaac ttggtaaagg caagcttctg tacctttgtt ataattaatt 34 80 
gtatacctgt gtatgtaaat ataaggcatt cctattttgc agttcagaac aaaaaaaact * 3 540 

tatttgtaat atagaataaa gtttattaaa aaataagaaa aaaaaaaaaa aaaaaaa 3 597 



<210> 71 
<211> 855 
<212> DNA 

<213> Homo sapiens 
<400> 71 

cgctcaatta tctactcgag tctagactcg aggcggccgc ccattgtgca ctaaagcagg 60 

ggatagcaac ggcgtccctc ctccccgctc agctgcagcc cgcagtcctc acagtggtaa 12 0 

catgccacgt ggtagtctct gtccatggac accacacgga tggttgtctc gcagccctgt 180 

gcagggagga taggacgggc acaggaggcg cattttggtg caaaaaccgt gtgatagtct 240 

cgcacgcagt agatgttgtt ctccacgtcc acggtgaagg gaaccccgtc caggcactca 3 00 

ttgcacacgg agcaccggaa gcagcctggg tggtaggact tgcccagggc ctgcaggatc 3 60 

atttccatga tgagatgtcc acacacgctg catttgtcgg ccgtctgctg gaacccggag 420 

tacaggaagt cctcctggca gtacactttc tcacccacgt tgtagaacgc cttcccacgg 480 

agtcgtctcc cacacgagtc gcaggtgaag cagtcagtgt gataaagact ccccattgcc 540 

tggcacgcct gctgggctcc gtagatgcca agcccacact tgatgcaaat gccgaagtag 600 

tcccgcgccg tgcgcgcctc gagcgcccgc tccagctccc gggtgagcgc ctccagccgc 660 

cgctcggccg cgcttgggcc gccctcccgg ccagggggca gcgggagtgc aggcagcggg 72 0 

aagggggccg gccccgcagg ctccggggag cgagcgggcg cggggcaggc gccgggcggg 780 

aggaagtcag cgtaggcttt agtgcacaat gggaattcgg atccggtcga cactagttct 84 0 

aggatctctt ttttt 8 55 

<210> 72 
<211> 3791 
<212> DNA 

<213> Homo sapiens 
<400> 72 

acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 60 

tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 120 

ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 180 

acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 24 0 

ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 3 00 

ttatcttcag aacatttcag tcattcctca aagtggctag aaacttgtca gcatgaatca 3 60 

gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 42 0 

gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 4 80 

gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 54 0 

atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 600 

atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 660 

ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 72 0 

aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 780 
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aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 84 0 

ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 900 

agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 960 

tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 102 0 

caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 10 80 

tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 114 0 

aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 12 00 

cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 12 60 

acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 132 0 

ctcctgtgtg gccggcctcc agatctgact gccttgtctc gacatgactt ggaagataac 13 80 

ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc agcttcggga ctggaagagc 144 0 

cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 1500 

agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 1560 

cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 1620 

cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 16 8 0 

cgtgcagaaa cattggtctg ttgctgtttt gatttgctga agaaattgag ggcaaagctc 174 0 

cagagcctca aagcagaaag ggaggaggca aggcacagag aggaaatggc tctcagaggc 1800 

aaggatgcgg cagagatagt gttggaggct ttctgtgcac acgccagcca gcgcatcagc 1860 

cagctggaac aggacctagc atccatgcgg gaattcagag gccttctgaa ggatgcccag 192 0 

acccaactgg tagggcttca tgccaagcaa gaagagctgg ttcagcagac agtgagtctt 1980 

acttctacct tgcaacaaga ctggaggtcc atgcaactgg attatacaac atggacagct 2 04 0 

ttgctgagtc ggtcccgaca actcacagag aaactcacag tcaagagcca gcaagccctg 2100 

caggaacgtg atgtggcaat tgaggaaaag caggaggttt ctagggtgct ggaacaagtc 2160. 

tctgcccagt tagaggagtg caaaggccaa acagaacaac tggagttgga aaacattcgt 2220 

ctagcaacag atctccgggc tcagttgcag attctggcca acatggacag ccagctaaaa 2280 

gagctacaga gtcagcatac ccattgtgcc caggacctgg ctatgaagga tgagttactc 2340 

tgccagctta cccagagcaa tgaggagcag gctgctcaat gcgtaaagga agagatggca 24 0 0 

ctaaaacaca tgcaggcaga actgcagcag caacaagctg tcctggccaa agaggtgcgg . 2460 

gacctgaaag agaccttgga gtttgcagac caggagaatc aggttgctca cctggagctg 2520 

ggtcaggttg agtgtcaatt gaaaaccaca ctggaagtgc tccgggagcg cagcttgcag 2580 

tgtgagaacc tcaaggacac tgtagagaac ctaacggcta aactggccag caccatagca 264 0 

gataaccagg agcaagatct ggagaaaaca cggcagtact ctcaaaagct agggctgctg 2700 

actgagcaac tacagagcct gactctcttt ctacagacaa aactaaagga gaagactgaa 2760 

caagagaccc ttctgctgag tacagcctgt cctcccaccc aggaacaccc tctgcctaat 2 82 0 

gacaggacct tcctgggaag catcttgaca gcagtggcag atgaagagcc agaatcaact 2 880 

cctgtgccct tgcttggaag tgacaagagt gctttcaccc gagtagcatc aatggtttcc 294 0 

cttcagcccg cagagacccc aggcatggag gagagcctgg cagaaatgag tattatgact 3 000 

actgagcttc agagtctttg ttccctgcta caagagtcta aagaagaagc catcaggact 3060 

ctgcagcgaa aaatttgtga gctgcaagct aggctgcagg cccaggaaga acagcatcag 312 0 

gaagtccaga aggcaaaaga agcagacata gagaagctga accaggcctt gtgcttgcgc 3180 

tacaagaatg aaaaggagct ccaggaagtg atacagcaga atgagaagat cctagaacag 3240 

atagacaaga gtggcgagct cataagcctt agagaggagg tgacccacct tacccgctca 33 00 

cttcggcgtg cggagacaga gaccaaagtg ctccaggagg cctggcaggc cagctggact 3360 

ccaactgcca gcctatggcc accaattgga tccaggagaa agtgtggctc tctcaggagg 342 0 

tggacaaact gagagtgafcg ttcctggaga tgaaaaatga gaaggaaaac tcctgatcaa 34 8 0 

gttccagagc ccatagaaat atcctagagg agaaccttcg gcgctctgac aaggagttag 354 0 

aaaaactaga tgacattgtt cagcatattt ataagaccct gctctctatt ccagaggtgg 3600 

tgaggggatg caaagaacta cagggattgc tggaatttct gagctaagaa actgaaagcc 3 660 

agaatttgtt tcacctcttt trtacctgcaa taccccctta ccccaatacc aagaccaact 372 0 

ggcatagagc caactgagat aaatgctatt taaataaagt gtatttaatg aaaaaaaaaa 3 78 0 

aaaaaaaaaa a 3791 

<210> 73 
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<211> 1683 
<212> DNA 

<213> Homo sapiens 



<400> 73 

ctctgagtgt ccagtggtca gttgccccag gatggggacc acagccagag cagccttggt 60 

cttgacctat ttggctgttg cttctgctgc ctctgaggga ggcttcacgg ctacaggaca 120 

gaggcagctg aggccagagc actttcaaga agttggctac gcagctcccc cctccccacc 180 

cctatcccga agcctcccca tggatcaccc tgactcctct cagcatggcc ctccctttga 240 

gggacagagt caagtgcagc cccctccctc tcaggaggcc acccctctcc aacaggaaaa 3 00 

gctgctacct gcccaactcc ctgctgaaaa ggaagtgggt ccccctctcc ctcaggaagc 360 

tgtccccctc caaaaagagc tgccctctct ccagcacccc aatgaacaga aggaaggaac 420 

gccagctcca tttggggacc agagccatcc agaacctgag tcctggaatg cagcccagca 480 

ctgccaacag gaccggtccc aagggggctg gggccaccgg ctggatggct tcccccctgg 54 0 

gcggccttct ccagacaatc tgaaccaaat ctgccttcct aaccgtcagc atgtggtata 600 

tggtccctgg aacctaccac agtccagcta ctcccacctc actcgccagg gtgagaccct 660 

caatttcctg gagattggat attcccgctg ctgccactgc cgcagccaca caaaccgcct 720 

agagtgtgcc aaacttgtgt gggaggaagc aatgagccga ttctgtgagg ccgagttctc 760 

ggtcaagacc cgaccccact ggtgctgcac gcggcagggg gaggctcggt tctcctgctt 840 

ccaggaggaa gctccccagc cacactacca gctccgggcc tgccccagcc atcagcctga 900 

tatttcctcg ggtcttgagc tgcctttccc tcctggggtg cccacattgg acaatatcaa 960 

gaacatctgc cacctgaggc gcttccgctc tgtgccacgc aacctgccag ctactgaccc 1020 

cctacaaagg gagctgctgg cactgatcca gctggagagg gagttccagc gctgctgccg 10 80 

ccaggggaac aatcacacct gtacatggaa ggcctgggag gatacccttg acaaatactg 1140 

tgaccgggag tatgctgtga agacccacca ccacttgtgt tgccgccacc ctcccagccc 12 00 

tactcgggat gagtgctttg cccgtcgggc tccttacccc aactatgacc gggacatctt 12 60 

gaccattgac atcagtcgag tcacccccaa cctcatgggc cacctctgtg gaaaccaaag 1320 

agttctcacc aagcataaac atattcctgg gctgatccac aacatgactg cccgctgctg 13 80 

tgacctgcca tttccagaac aggcctgctg tgcagaggag gagaaattaa ccttcatcaa 1440 * 

tgatctgtgt ggtccccgac gtaacatctg gcgagaccct gccctctgct gttacctgag 15 00 

tcctggggat gaacaggtca actgcttcaa catcaattat ctgaggaacg tggctctagt 15 60 *■-. 

gtctggagac actgagaacg ccaagggcca gggggagcag ggctcaactg gaggaacaaa 1620 •* 

tatcagctcc acctctgagc ccaaggaaga atgagtcacc ccagagccct agagggtcag 1680 • 

atg 1683 



<210> 74 
<211> 1696 
<212> DNA 

<213> Homo sapiens 



<400> 74 

cacctaaaag 

gtagattcgg 

aaaagaacca 

tatgcctggg 

tccttgtgga 

agagacttta 

gttgctgctg 

catgcccttc 

gattccactg 

acttacatta 

tggaatggtg 

gtcacccgta 



ccaaaatggg 
gcaagtccac 
ttgaaaaatt 
tcttggataa 
aatttgagac 
tcaaaaacat 
gtgttggtga 
tggcttacac 
agccacccta 
agaaaattgg 
acaacatgct 
aggatggcaa 



aaaggaaaag 
cactactggc 
tgagaaggag 
actgaaagct 
cagcaagtac 
gattacaggg 
atttgaagct 
actgggtgtg 
cagccagaag 
ctacaacccc 
ggagccaagt 
tgccagtgga 



actcatatca 
catctgatct 
gctgctgaga 
gagcgtgaac 
tatgtgacta 
acatctcagg 
ggtatctcca 
aaacaactaa 
agatatgagg 
gacacagtag 
gctaacatgc 
accacgctgc 



acattgtcgt 
ataaatgcgg 
tgggaaaggg 
gtggtatcac 
tcattgatgc 
ctgactgtgc 
agaatgggca 
ttgtcggtgt 
aaattgttaa 
catttgtgcc 
cttggttcaa 
ttgaggctgt 



cattggacac 


60 


tggcatcgac 


120 


ctccttcaag 


180 


cattgatatc 


240 


cccaggacac 


300 


tgtcctgatt 


360 


gacccgagag 


420 


taacaaaatg 


480 


ggaagtcagc 


540 


aatttctggt 


600 


gggatggaaa 


660 


ggactgcatc 


720 
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ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc ctctccagga tgtctacaaa 
attggtggta ttggtactgt tcctgttggc cgagtggaga ctggtgttct caaacccggt 
atggtggtca cctttgctcc agtcaacgtt acaacggaag taaaatctgt cgaaatgcac 
catgaagctt tgagtgaagc tcttcctggg gacaatgtgg gcttcaatgt caagaatgtg 
tctgtcaagg atgttcgtcg tggcaacgtt gctggtgaca gcaaaaatga cccaccaatg 
gaagcagctg gcttcactgc tcaggtgatt atcctgaacc atccaggcca aataagcgcc 
ggctatgccc ctgtattgga ttgccacacg gctcacattg catgcaagtt tgctgagctg 
aaggaaaaga ttgatcgccg ttctggtaaa aagctggaag atggccctaa attcttgaag 
tctggtgatg ctgccattgt tgatatggtt cctggcaagc ccatgtgtgt tgagagcttc 
tcagactatc cacctttggg tcgctttgct gttcgtgata tgagacagac agttgcggtg 
ggtgtcatca aagcagtgga caagaaggct gctggagctg gcaaggtcac caagtctgcc 
cagaaagctc agaaggctaa atgaatatta tccctaatac ctgccacccc actcttaatc 
agtggtggaa gaacggtctc agaactgttt gtttcaattg gccatttaag tttagtagta 
aaagactggt taatgataac aatgcatcgt aaaaccttca gaaggaaagg agaatgtttt 
gtggaccact ttggttttct tttttgcgtg tggcagtttt aagttattag tttttaaaat 
cagtactttt taatggaaac aacttgacca aaaatttgtc acagaatttt gagacccatt 
aaaaaagtta aatgag 

<210> 75 

<211> 7680 

<212> DNA 

<213> Homo sapiens 



gaagagcaag aggcaggctc agcaaatggt tcagccccag tccccggtgg ctgtcagtca 
aagcaagccc ggttgttatg acaatggaaa acactatcag ataaatcaac agtgggagcg 
gacctaccta ggtaatgtgt tggtttgtac ttgttatgga ggaagccgag gttttaactg 
cgaaagtaaa cctgaagctg aagagacttg ctttgacaag tacactggga acacttaccg 
agtgggtgac acttatgagc gtcctaaaga ctccatgatc tgggactgta cctgcatcgg 
ggctgggcga gggagaataa gctgtaccat cgcaaaccgc tgccatgaag ggggtcagtc 
ctacaagatt ggtgacacct ggaggagacc acatgagact ggtggttaca tgttagagtg 
tgtgtgtctt ggtaatggaa aaggagaatg gacctgcaag cccatagctg agaagtgttt 
tgatcatgct gctgggactt cctatgtggt cggagaaacg tgggagaagc cctaccaagg 
ctggatgatg gtagattgta cttgcctggg agaaggcagc ggacgcatca cttgcacttc 
tagaaataga tgcaacgatc aggacacaag gacatcctat agaattggag acacctggag 
caagaaggat aatcgaggaa acctgctcca gtgcatctgc acaggcaacg gccgaggaga 
gtggaagtgt gagaggcaca cctctgtgca gaccacatcg agcggatctg gccccttcac 
cgatgttcgt gcagctgttt accaaccgca gcctcacccc cagcctcctc cctatggcca 
ctgtgtcaca gacagtggtg tggtctactc tgtggggatg cagtggttga agacacaagg 
aaataagcaa atgctttgca cgtgcctggg caacggagtc agctgccaag agacagctgt 
aacccagact tacggtggca acttaaatgg agagccatgt gtcttaccat tcacctacaa 
tggcaggacg ttctactcct gcaccacgga agggcgacag gacggacatc tttggtgcag 
cacaacttcg aattatgagc aggaccagaa atactctttc tgcacagacc acactgtttt 
ggttcagact caaggaggaa attccaatgg tgccttgtgc cacttcccct tcctatacaa 
caaccacaat tacactgatt gcacttctga gggcagaaga gacaacatga agtggtgtgg 
gaccacacag aactatgatg ccgaccagaa gtttgggttc tgccccatgg ctgcccacga 
ggaaatctgc acaaccaatg aaggggtcat gtaccgcatt ggagatcagt gggataagca 
gcatgacatg ggtcacatga tgaggtgcac gtgtgttggg aatggtcgtg gggaatggac 
atgcattgcc tactcgcaac ttcgagatca gtgcattgtt gatgacatca cttacaatgt 
gaacgacaca ttccacaagc gtcatgaaga ggggcacatg ctgaactgta catgcttcgg 
tcagggtcgg ggcaggtgga agtgtgatcc cgtcgaccaa tgccaggatt cagagactgg 
gacgttttat caaattggag attcatggga gaagtatgtg catggtgtca gataccagtg 
ctactgctat ggccgtggca ttggggagtg gcattgccaa cctttacaga cctatccaag 
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ctcaagtggt cctgtcgaag tatttatcac tgagactccg agtcagccca actcccaccc 
catccagtgg aatgcaccac agccatctca catttccaag tacattctca ggtggagacc 
taaaaattct gtaggccgtt ggaaggaagc taccatacca ggccacttaa actcctacac 
catcaaaggc ctgaagcctg gtgtggtata cgagggccag ctcatcagca tccagcagta 
cggccaccaa gaagtgactc gctttgactt caccaccacc agcaccagca cacctgtgac 
cagcaacacc gtgacaggag agacgactcc cttttctcct cttgtggcca cttctgaatc 
tgtgaccgaa atcacagcca gtagctttgt ggtctcctgg gtctcagctt ccgacaccgt 
gtcgggattc cgggtggaat atgagctgag tgaggaggga gatgagccac agtacctgga 
tcttccaagc acagccactt ctgtgaacat ccctgacctg cttcctggcc gaaaatacat 
tgtaaatgtc tatcagatat ctgaggatgg ggagcagagt ttgatcctgt ctacttcaca 
aacaacagcg cctgatgccc ctcctgaccc gactgtggac caagttgatg acacctcaat 
tgttgttcgc tggagcagac cccaggctcc catcacaggg tacagaatag tctattcgcc 
atcagtagaa ggtagcagca cagaactcaa ccttcctgaa actgcaaact ccgtcaccct 
cagtgacttg caacctggtg ttcagtataa catcactatc tatgctgtgg aagaaaatca 
agaaagtaca cctgttgtca ttcaacaaga aaccactggc accccacgct cagatacagt 
gccctctccc agggacctgc agtttgtgga agtgacagac gtgaaggtca ccatcatgtg 
gacaccgcct gagagtgcag tgaccggcta ccgtgtggat gtgatccccg tcaacctgcc 
tggcgagcac gggcagaggc tgcccatcag caggaacacc tttgcagaag tcaccgggct 
gtcccctggg gtcacctatt acttcaaagt ctttgcagtg agccatggga gggagagcaa 
gcctctgact gctcaacaga caaccaaact ggatgctccc actaacctcc agtttgtcaa 
tgaaactgat tctactgtcc tggtgagatg gactccacct cgggcccaga taacaggata 
ccgactgacc gtgggcctta cccgaagagg ccagcccagg cagtacaatg tgggtccctc 
tgtctccaag taccccctga ggaatctgca gcctgcatct gagtacaccg tatccctcgt 
ggccataaag ggcaaccaag agagccccaa agccactgga gtctttacca cactgcagcc 
tgggagctct attccacctt acaacaccga ggtgactgag accaccatcg tgatcacatg 
gacgcctgct ccaagaattg gttttaagct gggtgtacga ccaagccagg gaggagaggc 
accacgagaa gtgacttcag actcaggaag catcgttgtg tccggcttga ctccaggagt 
agaatacgtc tacaccatcc aagtcctgag* agatggacag gaaagagatg cgccaattgt 
aaacaaagtg gtgacaccat tgtctccacc aacaaacttg catctggagg caaaccctga 
cactggagtg ctcacagtct cctgggagag gagcaccacc ccagacatta ctggttatag 
aattaccaca acccctacaa acggccagca gggaaattct ttggaagaag tggtccatgc 
tgatcagagc tcctgcactt ttgataacct gagtcccggc ctggagtaca atgtcagtgt 
ttacactgtc aaggatgaca aggaaagtgt ccctatctct gataccatca tcccagctgt 
tcctcctccc actgacctgc gattcaccaa cattggtcca gacaccatgc gtgtcacctg 
ggctccaccc ccatccattg atttaaccaa cttcctggtg cgttactcac ctgtgaaaaa 
tgaggaagat gttgcagagt tgtcaatttc tccttcagac aatgcagtgg tcttaacaaa 
tctcctgcct ggtacagaat atgtagtgag tgtctccagt gtctacgaac aacatgagag 
cacacctctt agaggaagac agaaaacagg tcttgattcc ccaactggca ttgacttttc 
tgatattact gccaactctt ttactgtgca ctggattgct cctcgagcca ccatcactgg 
ctacaggatc cgccatcatc ccgagcactt cagtgggaga cctcgagaag atcgggtgcc 
ccactctcgg aattccatca ccctcaccaa cctcactcca ggcacagagt atgtggtcag 
catcgttgct cttaatggca gagaggaaag tcccttattg attggccaac aatcaacagt 
ttctgatgtt ccgagggacc tggaagttgt tgctgcgacc cccaccagcc tactgatcag 
ctgggatgct cctgctgtca cagtgagata ttacaggatc acttacggag aaacaggagg 
aaatagccct gtccaggagt tcactgtgcc tgggagcaag tctacagcta ccatcagcgg 
ccttaaacct ggagttgatt ataccatcac tgtgtatgct gtcactggcc gtggagacag 
ccccgcaagc agcaagccaa tttccattaa ttaccgaaca gaaattgaca aaccatccca 
gatgcaagtg accgatgttc aggacaacag cattagtgtc aagtggctgc cttcaagttc 
ccctgttact ggttacagag taaccaccac tcccaaaaat ggaccaggac caacaaaaac 
taaaactgca ggtccagatc aaacagaaat gactattgaa ggcttgcagc ccacagtgga 
gtatgtggtt agtgtctatg ctcagaatcc aagcggagag agtcagcctc tggttcagac 
tgcagtaacc aacattgatc gccctaaagg actggcattc actgatgtgg atgtcgattc 
catcaaaatt gcttgggaaa gcccacaggg gcaagtttcc aggtacaggg tgacctactc 
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gagccctgag gatggaatcc atgagctatt ccctgcacct gatggtgaag aagacactgc 
agagctgcaa ggcctcagac cgggttctga gtacacagtc agtgtggttg ccttgcacga 
tgatatggag agccagcccc tgattggaac ccagtccaca gctattcctg caccaactga 
cctgaagttc actcaggtca cacccacaag cctgagcgcc cagtggacac cacccaatgt 
tcagctcact ggatatcgag tgcgggtgac ccccaaggag aagaccggac caatgaaaga 
aatcaacctt gctcctgaca gctcatccgt ggttgtatca ggacttatgg tggccaccaa 
atatgaagtg agtgtctatg ctcttaagga cactttgaca agcagaccag ctcagggtgt 
tgtcaccact ctggagaatg tcagcccacc aagaagggct cgtgtgacag atgctactga 
gaccaccatc accattagct ggagaaccaa gactgagacg atcactggct tccaagttga 
tgccgttcca gccaatggcc agactccaat ccagagaacc atcaagccag atgtcagaag 
ctacaccatc acaggtttac aaccaggcac tgactacaag atctacctgt acaccttgaa 
tgacaatgct cggagctccc ctgtggtcat cgacgcctcc actgccattg atgcaccatc 
caacctgcgt ttcctggcca ccacacccaa ttccttgctg gtatcatggc agccgccacg 
tgccaggatt accggctaca tcatcaagta tgagaagcct gggtctcctc ccagagaag't 
ggtccctcgg ccccgccctg gtgtcacaga ggctactatt actggcctgg aaccgggaac 
cgaatataca atttatgtca ttgccctgaa gaataatcag aagagcgagc ccctgattgg 
aaggaaaaag acagacgagc ttccccaact ggtaaccctt ccacacccca atcttcatgg 
accagagatc ttggatgttc cttccacagt tcaaaagacc cctttcgtca cccaccctgg 
gtatgacact ggaaatggta ttcagcttcc tggcacttct ggtcagcaac ccagtgttgg 
gcaacaaatg atctttgagg aacatggttt taggcggacc acaccgccca caacggccac 
ccccataagg cataggccaa gaccataccc gccgaatgta ggacaagaag ctctctctca 
gacaaccatc tcatgggccc cattccagga cacttctgag tacatcattt catgtcatcc 
tgttggcact gatgaagaac ccttacagtt cagggttcct ggaacttcta ccagtgccac 
tctgacaggc ctcaccagag gtgccaccta caacatcata gtggaggcac tgaaagacca 
gcagaggcat aaggttcggg aagaggttgt taccgtgggc aactctgtca acgaaggctt 
gaaccaacct acggatgact cgtgctttga cccctacaca gtttcccatt atgccgttgg 
agatgagtgg gaacgaatgt ctgaatcagg ctttaaactg ttgtgccagt gcttaggctt 
tggaagtggt catttcagat gtgattcatc tagatggtgc catgacaatg gtgtgaacta 
caagattgga gagaagtggg accgtcaggg agaaaatggc cagatgatga gctgcacatg 
tcttgggaac ggaaaaggag aattcaagtg tgaccctcat gaggcaacgt gttacgatga 
tgggaagaca taccacgtag gagaacagtg gcagaaggaa tatctcggtg ccatttgctc 
ctgcacatgc tttggaggcc agcggggctg gcgctgtgac aactgccgca gacctggggg 
tgaacccagt cccgaaggca ctactggcca gtcctacaac cagtattctc agagatacca 
tcagagaaca aacactaatg ttaattgccc aattgagtgc ttcatgcctt tagatgtaca 
ggctgacaga gaagattccc gagagtaaat catctttcca atccagagga acaagcatgt 
ctctctgcca agatccatct aaactggagt gatgttagca gacccagctt agagttcttc 
tttctttctt aagccctttg ctctggagga agttctccag cttcagctca actcacagct 
tctccaagca tcaccctggg agtttcctga gggttttctc ataaatgagg gctgcacatt 
gcctgttctg cttcgaagta ttcaataccg ctcagtattt taaatgaagt gattctaaga 
tttggtttgg gatcaatagg aaagcatatg cagccaacca agatgcaaat gttttgaaat 
gatatgacca aaattttaag taggaaagtc acccaaacac ttctgctttc acttaagtgt 
ctggcccgca atactgtagg aacaagcatg atcttgttac tgtgatattt taaatatcca 
cagtactcac tttttccaaa tgatcctagt aattgcctag aaatatcttt ctcttacctg 
ttatttatca atttttccca gtatttttat acggaaaaaa ttgtattgaa aacacttagt 
atgcagttga taagaggaat ttggtataat tatggtgggt gattattttt tatactgtat 
gtgccaaagc tttactactg tggaaagaca actgttttaa taaaagattt acattccaca 



<210> 76 

<211> 1316 

<212> DNA 

<213> Homo sapiens 
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<400> 76 

tcctaatacg actcactata gggctcgagc ggccgcccgg gcaggtcgaa tgcaggcgac 60 

ttgcgagctg ggagcgattt aaaacgcttt ggattccccc ggcctgggtg gggagagcga 120 

gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc ggctccatgg 180 

agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg ctgggagcgg 24 0 

gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg gcgcctacgc 30 0 

tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag ccgccaaagc 360 

aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg ccttatggtt 420 

actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa ccctgtgccc 480 

aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 54 0 

gtcgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccac gctatggcca 600 

gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 660 

ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 72 0 

tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 780 

ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 840 

cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 900 

ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca 960 

tctggtttca gaaccgccgg. gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 102 0 

ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 1080 

agaccagaaa cctgccaagc ccaggctggg gccaaggact ctgctgagag gcccctagag 1140 

acaacaccct tcccaggcca ctggctgctg gactgttcct caggagcggc ctgggtaccc 12 00 

agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 1260 

acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 1316 



<210> 77 

<211> 566 

<212> DNA 

<213> Homo sapiens 



<400> 77 

cccaccaaac ccataaagag ggtgggtcga cccacgcgtc cgcggacgcg tgggaaatta 60 

ttgaattgga aacagaaata gaaaagttta aagctgagaa cgcatcttta gctaaacttc 12 0 

gcattgaacg agaaagtgcc ttggaaaaac tcaggaaaga aattgcagac ttcgaacaac 18 0 

agaaagcaaa agaattagct cgaatagaag agtttaaaaa ggaggagatg aggaagctac 24 0 

aaaaggaacg taaagttttt gaaaagtata ctacagctgc aagaactttt ccagataaaa 300 

aggaacgtga agaaatacag actttaaaac agcaaatagc agatttacgg gaagatttga 360 

aaagaaagga gaccaaatgg tcaagtacac acagccgtct cagaagccag atacaaatgt 420 

tagtcagaga gaacacagac ctccgggaag aaataaaagt gatggaaaga ttccgactgg 4 80 

atgcctggaa' gagagcagaa gccatagaga gcagcctcga ggtggagaag aaggacaagc 540 

ttgcgaacac atctgttcga tttcaa 566 

<210> 78 
<211> 5067 
<212> DNA 

<213> Homo sapiens 



<400> 7B 

gcccggacac ctgtctgcag catggataag tatgacgacc tgggcctgga ggccagtaaa 60 

ttcatcgagg acctgaacat gtatgaggcc tctaaggatg ggctcttccg agtggacaag 120 

ggtgcaggca acaaccccga gtttgaggaa actcgcaggg tgttcgccac caagatggcc 180 

aaaatccacc tccagcagca gcagcagcag ctcctgcagg aggagactct gcccaggggg 24 0 

agtagaggcc ctgtcaatgg agggggccgc ctgggcccac aggcccgttg ggaagttgtg 300 

ggcagcaagc tgactgtgga tggtgctgcc aagcctcctc ttgctgcctc gacaggggca 360 
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cctggggcag tcaccaccct cgctgctggg cagcccccgt acccaccgca ggagcagaga 
tccaggccat acctgcatgg cacgaggcat ggcagccagg actgtggttc cagggagagc 
ctggcgactt ctgagatgtc tgctttccac cagccaggcc cctgtgagga tccttcctgc 
ctcactcatg gagactatta tgacaacctc tccttggcaa gcccaaagtg gggtgacaaa 
ccaggagtgt cccccagcat cggcctgagt gtagggagtg ggtggcctag ctccccgggg 
agtgacccac cactgcccaa accctgcggg gaccatcccc taaatcaccg acagctctcc 
ctgagctcca gcaggtcttc tgagggtagc ctcggtggtc agaatagtgg cattggtggc 
cgcagcagcg agaagccaac aggcctttgg tccactgcct cctcccagcg ggtgagccct 
ggcctgcctt ccccaaactt ggagaacgga gcaccagctg tggggcctgt tcagcccagg 
actccttctg tgtcagcacc cttggccctg agctgcccca ggcaaggagg tcttccaaga 
tcaaactcgg ggctgggggg tgaggtttca ggtgtgatgt ccaaacccaa tgtggacccc 
caaccctggt tccaggatgg gcccaaatct tacctttcca gttctgcccc gtcatcctcg 
ccagctggcc tggacggttc acagcagggt gcggtccctg ggctggggcc gaagcctggc 
tgcacagacc ttggcactgg tcccaagctc agccccacca gtcttgtcca tccagtgatg 
tccaccctgc ctgagttatc ttgtaaagag ggtcccctgg gctggtcttc tgatggtagc 
ctgggatctg tgctcctgga cagccccagc tcccctaggg taaggctgcc ctgccagccc 
ctcgtcccag gtcctgagct gagaccctct gctgctgagt tgaaattaga agccctcacc 
caacgtctgg agcgagagat ggatgctcac ccgaaggctg attactttgg agcctgtgtg 
aaatgcagca aaggggtgtt tggggctggc caggcctgtc aggccatggg gaacctctac 
catgacacat gcttcacctg tgcagcttgc agccggaagc tgagaggaaa agccttttat 
tttgtcaacg gcaaagtgtt ttgtgaagaa gacttcctgt actctggttt ccagcagtcg 
gctgacaggt gttttctttg tggacatctg atcatggaca tgatcctgca agccctgggg 
aagtcctacc accccggctg tttccgctgt gtcatctgta atgagtgttt ggatggggtg 
cccttcaccg tggactcaga gaacaagatc tactgtgtcc gagattacca caaggtgctg 
gcccccaagt gtgcagcctg tgggcttccc atccttccac ctgagggctc agatgagacc 
atccgtgtcg tgtccatgga cagagactac cacgtggagt gttaccactg cgaggactgt 
ggtctggagc tcaatgatga agatggccac cgctgttatc cgctggagga ccacctgttc 
tgtcactcct gccacgtgaa gaggctggag aagagaccct catctacagc ccttcaccag 
caccacttct agccagagcc acttgcagac atcacggcag gggatgagga gccggggttg 
ctgctgctgc ttccggtggc ccctggggtg gaagtggggt aggggaagag gaggggcagg 
agggagagtt cctgtgagca tgtggggggt gcctttcctt taaccaggga ggtgaacact 
acctgcctcc tgcgtgtatt ttccaagtgc ttttctctgt tgccacattt tcctcaggtt 
actcaggaaa atgctccagc atgtgcgagc acatgacctg aggttgcatc atagcaccaa 
aggaatcctc ctgtcccctc tgggaacatt tcatgcttca gagggagagg tttttattga 
gcttgtttca caatatcccc ttgaagggac agctcagctg ccaatacatt caaccctttc 
tcttccttca ggaaaatacc tatacccaaa tgttccctcc cccgacatat atcatggcat 
gacttaaggc ttcttttcac ctgagagctt cagttcttct gcagaatggc tgcaaattta 
attgcattaa ggcaagaagg aagctctaat gtgtgctttg tatcctaaga taaatttgct 
tagaaaacca gagtcaagat ttgaaatagg tgaggcaggg tttcctcctt agacactgac 
agcattctcc gtaccccttc aaatccttac tctcctaaag gcagctgagt ccgcgacaga 
aatttgccct atgggagtaa aacatacttt gggagaagaa cttggtgcag gcaccaggat 
tttttttttt gcccacgtgt ttgcgctgtt tttctctgga gttctcaaga gttggtgact 
tggaaggccg cttctgcaag gcaagtctca ggaacccatg caggtacatc gcttgcacct 
gtttttagct tatttaatga cgggcttttg ggaagagctg cccgcatact gagagacagc 
ttcttataaa caaggagagt ttttgtgtgt gcgagatctc taagccagcg tgggagggag 
cgcctcagga taagttatta tattcatttc gttggtttct ctcctgccca attcttggca 
caggcattat gtttgaagaa accaggataa ggtacactgc ttttgtctgt ttaatttttt 
tagttgtttc ccttcacttt cagtcttcca cacacaaaaa atacctcaca gagcttcacc 
aaatcacaga ttcaggagga atttggcttt cacactggac tcagatacct tcttcagtgt 
gttggaaatc actggcttca cacaggccca actccaactg gtcagggcag agtgatcgta 
actaaaggtc agtggggaat agatccgatt cagtgctttt gccttatgca tttcagcatc 
ctggctcccc agggtggcag gagctgagga agggccacac actggcaaga tttcaagacc 
actctctgca ctgaagaggt aaaatttgca ctgcaagtca catccctgag gccagaggtc 
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agtacccttt ggtatttcga ttagaagaag ctgcaaaaga aaggcagccc attttaccat 
tgccagccag gccggggaca caggagccgg tgtgtgcact- ctgcctccta acattgcacc 
cagagcaaga ggactgggtg ctgggctgca gaggccggtc agtggagccc ctagcacgtg 
tgaactcagg cttttcattg ggcccggctc cacttctagg ccatgttttg actcatttgg 
taaccattgc ctgtaagcag cacagaattg gtgccatgga ttatcttttc catgttgatg 
gaattcattc tgttggaatc ctttggccag atgtcacttc agccagggtg tgcatcatca 
ttggttcttt ttcacaggct gagcctcctg aaaacccatg aacgctgggg ctggggaagt 
gaaccctgag gtggggaccc tctcttccca tcaaatcatc cagctcagtg tggggcgtgg 
caggggggta aatgaagcca gccaatgtgt taacctgtct ctgtcaacct aagaatgttg 
gccttactga cacacctttg ctccatgttc aagaccagaa gtagctggga tttgtttgca 
aattgggtaa ttagtttaaa aatctgtgat tacattttta aatgaaattt tcaaagtggc 
ctagattgag gtgattcaga taggtttgcg aatataccat tttatattgt tgagaaagaa 
caaaaaggga atttccagat gtcctagaaa tcctagcaac agatttctct ggttgtcagt 
ttccctggag aaggcgccag ataggaatct ccaatcagtt gtttttctct tcgcttcagg 
cccttacaca aaagccatga agagatgttc acctacccgg tattttaaat gttctgtaaa 
ttattagcca aatagaactg taatggggtt gtatfctatgg gcgcctagaa agaaaacaca 
aggacttggt aggccaggaa gaaaagattt taaaatttag aatgaatagc ccttctgggt 
tttctttttg acaattcttg gacttgaggt aaaacaagga ggattgtggc cggatttcag 
atcccaaagc cagcctccat cttaggcctt tgcctcattg tgccttttag gttttcttac 
ccaccgtctc ctgttttgtc ttttttttct tttctcctac ccctatcttg ggacattcag 
aaactgcctg ggtggtttga gaagagacaa cccagtttga tctgcaatac aaggatccat 
tcgtaatctc tctctcactg atgttattcc cccatctgcc gtcttggttc atctcaccac 
agaagggcat ttagtcctac ccagccatcg gctgcggtat gacagcagga tggcacttcc 
catttctctg tggttagtgc tcgagtgaaa acctctttca gctgagtcct ctgaggttct 
gctgttgagt cctgggtggc tgatggaatg attgaggagg tctggtcacc ctcaagcgcc 
gtcatcgcct tgtttccatg ggcttct 

<210> 79 

<211> 950 

<212> DNA 

<213> Homo sapiens 



tcgaccggat ccgaattccc attgtgcact aaagcgtctc cctgctccgc ggcccgggct 
Srgcgggcggg cgctcggctg gcggctgcag cagcagaggg agacccgcgg caaccccggc 
aacccagggc tcggcgtcgc tgccaccatg acgggaagca atatgtcgga cgccttggcc 
aacgccgtgt gccagcgctg ccaggcccgc ttctcccccg ccgagcgcat tgtcaacagc 
aatggggagc tgtaccatga gcactgcttc gtgtgtgccc agtgcttccg gcccttcccc 
gaggggctct tctatgagtt tgaaggccgg aagtactgcg aacacgactt ccaaatgctg 
tttgctccgt gctgtggatc ctgcggtgag ttcatcattg gccgcgtcat caaggccatg 
aacaacaact ggcacccggg ctgcttccgc tgcgagctgt gtgatgtgga gctggctgac 
ctgggctttg tgaagaatgc cggcaggcat ctctgccggc cttgccacaa ccgtgagaag 
gccaagggcc tgggcaagta catctgccag cggtgccacc tggtcatcga cgagcagccc 
ctcatgttca ggagcgacgc ctaccaccct gaccacttca actgcaccca ctgtgggaag 
gagctgacag ccgaggcccg cgagctgaag ggtgagctct actgcctgcc ctgccatgac 
aagatgggcg ttcccatctg cggggcctgc cgccggccca tcgagggccg agtggtcaac 
gcgctgggca agcagtggca cgtggagcac tttgtctgtg ccaagtgtga gaagccattc 
ctggggcacc ggcactatga gaagaagggc ctggcctact gcgagcttta gtgcacaatg 
ggcggccgcc tcgagtctag actcgagtag ataattgagc ggaatttctt 

<210> 80 
<211> 2346 
<212> DNA 
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<213> Homo sapiens 
<400> 80 

ccgccgtcgc ccccgcctcc ccctgcctca gcggctgccc ccgccagcgg gccgcccgct 
cccccgggcc ttgcagcggg ccccggcccg gctggagggg ccccgacccc agctctggtg 
gcgggcagca gcgccgcggc ccccttccct cacggggact cggccctgaa cgagcaggag 
aaggagttgc agcggcggct gaagcgtctc tacccggccg tggacgaaca agagacgccg 
ctgcctcggt cctggagccc gaaggacaag ttcagctaca tcggcctctc tcagaacaac 
ctgcgggtgc actacaaagg tcatggcaaa accccaaaag atgccgcgtc agttcgagcc 
acgcatccaa taccagcagc ctgtgggatt tattattttg aagtaaaaat tgtcagtaag 
ggaagagatg gttacatggg aattggtctt tctgctcaag gtgtgaacat gaatagacta 
ccaggttggg ataagcattc atatggttac catggggatg atggacattc gttttgttct 
tctggaactg gacaacctta tggaccaact ttcactactg gtgatgtcat tggctgttgt 
gttaatctta tcaacaatac ctgcttttac accaagaatg gacatagttt aggtattgct 
ttcactgacc taccgccaaa tttgtatcct actgtggggc ttcaaacacc aggagaagtg 
gtcgatgcca attttgggca acatcctttc gtgtttgata tagaagacta tatgcgggag 
tggagaacca aaatccaggc acagatagat cgatttccta tcggagatcg agaaggagaa 
tggcagacca tgatacaaaa aatggtttca tcttatttag tccaccatgg gtactgtgcc 
acagcagagg cctttgccag atctacagac cagaccgttc tagaagaatt agcttccatt 
aagaatagac aaagaattca gaaattggta ttagcaggaa gaatgggaga agccattgaa 
acaacacaac agttafcaccc aagtttactt gaaagaaatc ctaatctcct tttcacatta 
aaagtgcgtc agtttataga aatggtgaat ggtacagata gtgaagtacg atgtttggga 
ggccgaagtc caaagtctca agacagttat cctgfctagtc ctcgaccttt tagtagtcca 
agtatgagcc ccagccatgg aatgaatatc cacaatttag catcaggcaa aggaagcacc 
gcacattttt caggttttga aagttgtagt aatggtgtaa tatcaaataa agcacatcaa 
tcatattgcc atagtaataa acaccagtca tccaacttga atgtaccaga actaaacagt 
ataaatatgt caagatcaca gcaagttaat aacttcacca gtaatgatgt agacatggaa 
acagatcact actccaatgg agttggagaa acttcatcca atggtttcct aaatggtagc 
tctaaacatg accacgaaat ggaagattgt gacaccgaaa tggaagttga ttcaagtcag 
ttgagacgcc agttgtgtgg aggaagtcag gccgccatag aaagaatgat ccactttgga 
cgagagctgc aagcaatgag tgaacagcta aggagagact gtggcaagaa cactgcaaac 
aaaaaaatgt tgaaggatgc attcagtcta ctagcatatt cagatccctg gaacagccca 
gttggaaatc agcttgaccc gattcagaga gaacctgtgt gctcagctct taacagtgca 
atattagaaa cccacaatct gccaaagcaa cctccacttg ccctagcaat gggacaggcc 
acacaatgtc taggactgat ggctcgatca ggaattggat cctgcgcatt tgccacagtg 
gaagactacc tacattagct atgcatttca agagctcaca cttatattgt ggcatatagt 
caacatggaa gtagaccagc tctgctgatt tgaaatttag attttttaaa ttatgtactg 
gggacaggtt tttgtcgctt tacattgctt cctagtttac agcatgatgc aaatgatttt 
ctaacttagt gttaggagaa attattttcc atctttaacc tcttagttgt ctaagagtta 
aatattactg aatttcagac gttcaaattg atcatcacaa atcctttaaa acaattacct 
aaaagaaacc aaaaatcctg ccttctttgt gggggagggg ggagagaggg gaaggaaatg 
gaacaagttg tgtttgtgtt agcatgtggg tgatgtaaac ttcaaattgg gagatgttcc 
gacccc 

<210> 81 

<211> 2512 

<212> DNA 

<213> Homo sapiens 

<400> 81 

caatgcactg acggatatga gtgggatcct gtgagacagc aatgcaaaga tattgatgaa 
tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 
tacctctgcc ttccgaaaac agcccagatt attgtcaata atgaacagcc tcagcaggaa 
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acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 

gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtcgcaggc 

cctgaaatgc agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 

cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 

cacaacgtgt gccaagacat agacgagtgc actgcaggga cgcacaactg tagagcagac 

caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 

cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 

tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 

aacaactata cctgcgtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 

tgctacaaca ttcttggttc attcatctgt cagtgcaatc aaggatatga gctaagcagt 

gacaggctca actgtgaaga cattgatgaa tgcagaacct caagctacct gtgtcaatat 

.caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 

agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg ccgggaggat 

gaaatgtgtt ggaattatca tggcggcttc cgttgttatc cacgaaatcc ttgtcaagat 

ccctacattc taacaccaga gaaccgatgt gtttgcccag tctcaaatgc catgtgccga 

gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 

tcagacatct tccagataca ggccacaact atttatgcca acaccatcaa tacttttcgg 

attaaatctg gaaatgaaaa tggagagttc tacctacgac. aaacaagtcc tgtaagtgca 

atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 

ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 

gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcagcca 

aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 

tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 

tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 

actggtcttc ttcaagagag ctaagtatac actatctggt gaaacttgga ttctttccta 

taaaagtggg accaagcaat gatgatcttc tgtggtgctt aaggaaactt actagagctc 

cactaacagt ctcataagga ggcagccatc ataaccattg aatagcatgc aagggtaaga 

atgagttttt aactgctttg taagaaaatg gaaaaggtca ataaagatat atttctttag 

aaaatgggga tctgccatat ttgtgttggt ttttattttc atatccagcc taaaggtggt 

tgtttattat atagtaataa atcattgctg tacaacatgc tggtttctgt agggtatttt 

taattttgtc agaaatttta gattgtgaat attttgtaaa aaacagtaag caaaattttc 

cagaattccc aaaatgaacc agataccccc tagaaaatta tactattgag aaatctatgg 

ggaggatatg agaaaataaa ttccttctaa accacattgg aactgacctg aagaagcaaa 

ctcggaaaat ataataacat ccctgaattc aggcattcac aagatgcaga acaaaatgga 

taaaaggtat ttcactggag aagttttaat ttctaagtaa aatttaaatc ctaacacttc 

actaatttat aactaaaatt tctcatcttc gtacttgatg ctcacagagg aagaaaatga 

tgatggtttt tattcctggc atccagagtg acagtgaact taagcaaatt accctcctac 

ccaattctat ggaatatttt atacgtctcc ttgtttaaaa tctgactgct ttactttgat 

gtatcatatt tttaaataaa aataaatatt cctttagaag atcactctaa aa 

<210> 82 

<211> 2306 

<212> DNA 

<213> Homo sapiens 



gggcgggagc tgcacgcgcc gtggctccgg atctcttcgt ctttgcagcg tacgcccgag 

tcggtcagcg ccggaggacc tcagcagcca tgtcgaagcc ccatagtgaa gccgggactg 

ccttcattca gacccagcag ctgcacgcag ccatggctga cacattcctg gagcacatgt 

gccgcctgga cattgattca ccacccatca cagcccggaa cactggcatc atctgtacca 

ttggcccagc ttcccgatca gtggagacgt tgaaggagat gattaagtct ggaatgaatg 

tggctcgtct gaacttctct catggaactc atgagtacca tgcggagacc atcaagaatg 

tgcgcacagc cacggaaagc tttgcttctg acccctacct ctaccggccc gttgctgtgg 
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ctctagacac taaaggacct gagatccgaa ctgggctcat caagggcagc ggcactgcag 480 

agctggagct gaagaaggga gccactctca aaatcacgct ggataacgcc tacatggaaa 54 0 

agtgtgacga gaacatcctg tggctggact acaagaacat ctgcaaggtg gtggaagtgg 60 0 

gcagcaagat ctacgtggat gatgggctta tttctctcca ggtgaagcag aaaggtgccg 660 

acttcctggt gacggaggtg gaaaatggtg gctccttggg cagcaagaag ggtgtgaacc 72 0 

ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa ggacatccag gatctgaagt 78 0 

ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt catccgcaag gcatctgatg 84 0 

tccatgaagt taggaaggtc ctgggagaga agggaaagaa catcaagatt atcagcaaaa 90 0 

tcgagaatca tgagggggtt cggaggtttg atgaaatcct ggaggccagt gatgggatca 960 

tggtggctcg tggtgatcta ggcattgaga ttcctgcaga gaaggtcttc cttgctcaga 102 0 

agatgatgat tggacggtgc aaccgagctg ggaagcctgt catctgtgct actcagatgc 1080 

tggagagcat gatcaagaag ccccgcccca ctcgggctga aggcagtgat gtggccaatg 1140 

cagtcctgga tggagccgac tgcatcatgc tgtctggaga aacagccaaa ggggactatc 1200 

ctctggaggc tgtgcgcatg cagcacctga ttgcccgtga ggcagaggct gccatctacc 12 60 

acttgcaatt atttgaggaa ctccgccgcc tggcgcccat taccagcgac cccacagaag 132 0 

ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg cagtggggcc ataatcgtcc 13 80 

tcaccaagtc tggcaggtct gctcaccagg tggccagata ccgcccacgt gcccccatca 144 0 

ttgctgtgac ccggaatccc cagacagctc gtcaggccca cctgtaccgt ggcatcttcc 1500 

ctgtgctgtg caaggaccca gtccaggagg cctgggctga ggacgtggac ctccgggtga 1560 

actttgccat gaatgttggc aaggcccgag gcttcttcaa gaagggagat gtggtcattg 1620 

tgctgaccgg atggcgccct ggctccggct tcaccaacac catgcgtgtt gttcctgtgc 168 0 

cgtgatggac cccagagccc ctcctccagc ccctgtccca cccccttccc ccagcccatc 1740 

cattaggcca gcaacgcttg tagaactcac tctgggctgt aacgtggcac tggtaggttg 1800 

ggacaccagg gaagaagatc aacgcctcac tgaaacatgg ctgtgtttgc agcctgctct 1860 

agtgggacag cccagagcct ggctgcccat catgtggccc cacccaatca agggaagaag 192 0 

gaggaatgct ggactggagg cccctggagc cagatggcaa gagggtgaca gcttcctttc 1980 

ctgtgtgtac tctgtccagt tcctttagaa aaaatggatg cccagaggac tcccaaccct 204 0 

ggcttggggt caagaaacag ccagcaagag ttaggggcct tagggcactg ggctgttgtt 2100 

ccattgaagc cgactctggc cctggccctt acttgcttct ctagctctct aggcctctcc 2160 

agtttgcacc tgtccccacc ctccactcag ctgtcctgca gcaaacactc caccctccac 222 0 

cttccatttt cccccactac tgcagcacct ccaggcctgt tgctatagag cctacctgta 2280 

tgtcaataaa caacagctga agcacc 23 06 

<210> 83 

<211> 2656 

<212> DNA 

<213> Homo sapiens 

<400> 83 

gaattcgcgg ccgcagagtc cccgggccaa gatggctgcg cggtgctcca cacgctggtt 60 

gctggtggtt gtggggaccc cgcggctgcc ggctatatcg ggtagagggg cccggccgcc 120 

cagggagggc gtggtggggg catggctgag ccgcaagctg agcgtccccg cctttgcgtc 18 0 

ttccctgacc tcttgcggcc cccgagcgct gctgacattg agacctggtg tcagccttac 24 0 

aggaacaaaa cataaccctt tcatttgtac tgcctccttc cacacgagtg cccctttggc 300 

caaagaagat tattatcaga tattaggagt gcctcgaaat gccagccaga aagagatcaa 3 60 

gaaagcctat tatcagcttg ccaagaagta tcaccctgac acaaataagg atgatcccaa 420 

agccaaggag aagttctccc agctggcaga agcctatgag gttttgagtg atgaggtgaa 480 

gaggaagcag tacgatgcct acggctctgc aggcttcgat cctggggcca gcggctccca 540 

gcatagctac tggaagggag gccccactgt ggaccccgag gagctgttca ggaagatctt 600 

tggcgagttc tcatcctctt catttggaga tttccagacc gtgtttgatc agcctcagga 660 

atacttcatg gagttgacat tcaatcaagc tgcaaagggg gtcaacaagg agttcaccgt 72 0 

gaacatcatg gacacgtgtg agcgctgcaa cggcaagggg aacgagcccg gcaccaaggt 780 

gcagcattgc cactactgtg gcggctccgg catggaaacc atcaacacag gcccttttgt 84 0 
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gatgcgttcc acgtgtagga gatgtggtgg ccgcggctcc atcatcatat cgccctgtgt 900 

ggtctgcagg ggagcaggac aagccaagca gaaaaagcga gtgatgatcc ctgtgcctgc 96 0 

aggagtcgag gatggccaga ccgtgaggat gcctgtggga aaaagggaaa ttttcattac 102 0 

gttcagggtg cagaaaagcc ctgtgttccg gagggacggc gcagacatcc actccgacct 108 0 

ctttatttct atagctcagg ctcttcttgg gggaacagcc agagcccagg gcctgtacga 1140 

gacgatcaac gtgacgatcc cccctgggac tcagacagac cagaagattc ggatgggtgg 12 00 

gaaaggcatc ccccggatta acagctacgg ctacggagac cactacatcc acatcaagat 1260 

acgagttcca aagaggctaa cgagccggca gcagagcctg atcctgagct acgccgagga 1320 

cgagacagat gtggagggga cggtgaacgg cgtcaccctc accagctctg gtggcagcac 13 8 0 

catggatagc tccgcaggaa gcaaggctag gcgtgaggct ggggaggacg aggagggatt 144 0 

cctttccaaa cttaagaaaa tgtttacctc atgatatccc agccgaggaa aaagatccac 1500 

tggaaactag gccgggaagc agcagcccct ccaagggcca gggcacctgg gagacgggag 1560 

gattccagaa cagcagcact gagctcccac ccgcagagcc tctggacggc cttggcaaca 1620 

gcaaaatcat gggacaacac ctctctccac ggaaaggtca cagtggacag cccgggcagt 1680 

aggatgcagc cccagaggct ggtggcagtt tcctgtccat tggtaggtga cggccccctg 174 0 

gtcagcagag gagaggttag atcttgcagg ctaaaactct aatttggaat tgaatattgt 18 00 

ggatatctta gttaaaggcc atgcttacag cttagaaatg aagccttaag ctgcatcaag 1860 

ttacgaagtg attaatttcc ttctcagcaa acctccggga ggttccagaa tgagttcttc 192 0 

ctgacaggtt gtcttcactg ggagcgtggg gcccccaggc cccaccagca ccgtcctccc 1980 

ctaatgaggg gccctgccga ggcatcagct gctctgctca gttagttttt attcccgggg 2 040 

taccaagcag ctgcacagtc ggtgcctggg aagcacgtta aaggcccaga gagatcctgg 2100 

gggttctgct ctgaccgtgt gggtggtgat ccttgtcagg atgtacagtc cttgctccca 2160 

ccccatccgg gatggccgcc tgtccctgac tattgagtcc tgttgttgta agccaggcat 2220 

ggagggctcc tgcccttctg ctgagccaca gcccattgca gcactgtgct ggccagactt 22 80 

cagctgcctt gggaactgaa gccctgccac tgttgctagt caggggcttg gttctcccac 2340 

ttacactgtt gacatctatt ttctgaagtg tgtttaaatt attcagtgct aatcattgtt 2400 

ttttcctttg taaatgttga ttcagaaaag gaaagcacag gctaagcagt tgaaggttcc 2460. 

ccaccattca gtgagagcag aacccccatt ccccagcctc tgctggtagc atgtcgcagt 2520 

ttccatgtgt ttcaggatct tcgggctgtc gttagacagg ttaatgaaga acacttctca 2580 

acagtttcct ttttgttttc ctttataatt cactaaaata aagcatctat tagtgtctga 2640 

aaaaaaaaaa aaaaaa 2656 

<210> 84 
<211> 2217 
<212> DNA 

<213> Homo sapiens 
<400> 84 

gcggacccgg cgccgaggcg gccacccgag acgcggcgcg cacgctccgg cctgcgcagc 60 

ccggcccggc catggcggcc ccccgcccgt ctcccgcgat ctccgtttcg gtctcggctc 12 0 

cggcttttta cgccccgcag aagaagttcg gccctgtggt ggccccaaag cccaaagtga 180 

atcccttccg gcccggggac agcgagcctc ccccggcacc cggggcccag cgcgcacaga 240 

tgggccgggt gggcgagatt cccccgccgc ccccggaaga ctttcccctg cctccacctc 3 00 

cccttgctgg ggatggcgac gatgcagagg gtgctctggg aggtgccttc ccgccgcccc 3 60 

ctcccccgat cgaggaatca tttccccctg cgcctctgga ggaggagatc ttcccttccc 42 0 

cgccgcctcc tccggaggag gagggagggc ctgaggcccc cataccgccc ccaccacagc 4 80 

ccagggagaa ggtgagcagt attgatttgg agatcgactc tctgtcctca ctgctggatg 540 

acatgaccaa gaatgatcct ttcaaagccc gggtgtcatc tggatatgtg cccccaccag 600 

tggccactcc attcagttcc aagtccagta ccaagcctgc agccgggggc acagcacccc 660 

tgcctccttg gaagtcccct tccagctccc agcctctgcc ccaggttccg gctccggctc 720 

agagccagac acagttccat gttcagcccc agccccagcc caagcctcag gtccaactcc 7 80 

atgtccagtc ccagacccag cctgtgtctt tggctaacac ccagccccga gggcccccag 840 

cctcatctcc ggctccagcc cctaagtttt ctccagtgac tcctaagttt actcctgtgg 900 
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cttccaagtt cagtcctgga gccccaggtg gatctgggtc acaaccaaat caaaaattgg 960 

ggcaccccga agctctttct gctggcacag gctcccctca acctcccagc ttcacctatg 102 0 

cccagcagag ggagaagccc cgagtgcagg agaagcagca ccccgtgccc ccaccggctc 1080 

agaaccaaaa ccaggtgcgc tcccctgggg ccccagggcc cctgactctg aaggaggtgg 114 0 

aggagctgga gcagctgacc cagcagctaa tgcaggacat ggagcatcct cagaggcaga 1200 

atgtggctgt caacgaactc tgcggccgat gccatcaacc cctggcccgg gcgcagccag 12 60 

ccgtccgcgc tctagggcag ctgttccaca tcgcctgctt cacctgccac cagtgtgcgc 132 0 

agcagctcca gggccagcag ttctacagtc tggagggggc gccgtactgc gagggctgtt 13 80 

acactgacac cctggagaag tgtaacacct gcggggagcc "catcactgac cgcatgctga 144 0 

gggccacggg caaggcctat cacccgcact gcttcacctg tgtggtctgc gcccgccccc 1500 

tggagggcac ctccttcatc gtggaccagg ccaaccggcc ccactgtgtc cccgactacc 1560 

acaagcagta cgccccgagg tgctccgtct gctctgagcc catcatgcct gagcctggcc 162 0 

gagatgagac tgtgcgagtg gtcgccctgg acaagaactt ccacatgaag tgttacaagt 168 0 

gtgaggactg cgggaagccc ctgtcgattg aggcagatga caatggctgc ttccccctgg 1740 

acggtcacgt gctctgtcgg aagtgccaca ctgctagagc ccagacctga gtgaggacag 18 00 

gccctcttca gaccgcagtc catgccccat tgtggaccac ccacactgag accacctgcc 1860 

cccacctcag ttattgtttt gatgtctagc ccctcccatt tccaacccct ccctagcatc 1920 

ccaggtgccc tgacccagga cccaacatgg tctagggatg caggatcccc gccctggggt 1980 

ctggtcctcg cccatcctgc agggattgcc caccgtcttc cagacacccc acctgagggg 2040 

ggcaccaggt ttagtgctgc tgctttcact gctgcacccg cgccctcggc cggccccccg 2100 

agcagccttt gtactctgct tgcggagggc tgggagaccc tccaggacat tcccaccctc 2160 

ccccatgctg ccaagttgta gctatagcta caaataaaaa aaaaccttgt tttccag 2217 

<210> 85 
<211> 8906 
<212> DNA 

<213> Homo sapiens 
<400> 85 

gaggcggcca aggacctggc cgacatcgcg gccttcttcc gatccgggtt tcgaaaaaac 60 

gatgaaatga aagctatgga tgttttacca attttgaagg aaaaagttgc atacctttca 12 0 

ggtgggagag ataaacgtgg aggtcccatt ttaacgtttc cggcccgcag caatcatgac 180 

agaatacgac aggaggatct caggagactc atttcctatc tagcctgtat tcccagcgag 24 0 

gaggtctgca agcgtggctt cacggtgatc gtggacatgc gtgggtccaa gtgggactcc 3 00 

atcaagcccc ttctgaagat cctgcaggag tccttcccct gctgcatcca tgtggccctg 3 60 

atcatcaagc cagacaactt ctggcagaaa cagaggacta attttggcag ttctaaattt 42 0 

gaatttgaga caaatatggt ctctttagaa ggccttacca aagtagttga tccttctcag 480 

ctaactcctg agtttgatgg ctgcctggaa tacaaccacg aagaatggat tgaaatcaga 54 0 

gttgcttttg aagactacat tagcaatgcc acccacatgc tgtctcggct ggaggaactt 60 0 

caggacatcc tagctaagaa ggagctgcct caggatttag agggggctcg gaatatgatc 660 

gaggaacatt ctcagctgaa gaagaaggtg attaaggccc ccatcgagga cctggatttg 72 0 

gagggacaga agctgcttca gaggatacag agcagtgaaa gctttcccaa aaagaactca 78 0 

ggctcaggca atgcggacct gcagaacctc ttgcccaagg tgtccaccat gctggaccgg 84 0 

ctgcactcga cacggcagca tctgcaccag atgtggcatg tgaggaagct gaagctggac 900 

cagtgcttcc agctgaggct gtttgaacag gatgctgaga agatgtttga ctggatcaca 960 

cacaacaaag gcctgtttct aaacagctac acagagattg ggaccagcca ccctcatgcc 102 0 

atggagcttc agacgcagca caatcacttt gccatgaact gtatgaacgt gtatgtaaat 10 80 

ataaaccgca tcatgtcggt ggccaatcgt ctggtggagt ctggccacta tgcctcgcag 114 0 

cagatcaggc agatcgcgag tcagctggag caggagtgga aggcgtttgc ggcagccctg 120 0 

gatgagcgga gcaccttgct ggacatgtcc tccattttcc accagaaggc cgaaaagtat 12 6 0 

atgagcaacg tggattcatg gtgtaaagct tgcggtgagg tagaccttcc ctcagagctg 1320 

caggacctag aagatgccat tcatcaccac cagggaatat atgaacatat cactcttgct 13 80 

tattctgagg tcagccaaga tgggaagtcg ctccttgaca agctccagcg gcccttgact 144 0 
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cccggcagct 
ctggatgtca 
cgcaaggtcc 
gtgctagact 
aaatctcttc 
cagaacacat 
ggggaatgtg 
gatttcgttc 
acccatgtga 
gtgtatgccg 
cagaccaccc 
ctcagggact 
gagacggtgc 
cgcaagatca 
attatctcag 
acagaagatc 
atgaacaact 
gaggtccagg 
gtccaggacc 
cagcatcgga 
caggtgctgg 
agctcgttac 
gagaaaacac 
aaccactacg 
cagctcatgc 
aaaacctcag 
gaagactggt 
cccatgatca 
cggaggaatg 
atggtgacgc 
caacgggaga 
cagcagtacg 
ggcgagttct 
ctcctgaaag 
ctattgatac 
aaaaaatgtg 
tacaggacct 
agtctccagc 
gctgctcatg 
gctgagctca 
tacctgtggg 
ctcatcatct 
gagctggaaa 
gacaagtttc 
ttggaacatg 
atttcttcct 
gagctgctga 
ctcagcgtgc 
gaaaacattg 
aaaaccttaa 
tttagtaaag 
tttacctcag 
tgggtgggga 



ccgattccct 
tccacgaggt 
ggctgcatca 
ggatcgagaa 
atcgggccag 
acaccaatgc 
accccgaaga 
ggcgtgttga 
aagagctgtg 
agtcggtgga 
tgcaggtgac 
ctgccatctc 
tgcagcagct 
agctggagct 
acctcgagtc 
tcacgattgc 
tgacttttga 
cctctggtgt 
tgctggagtt 
aacacctgga 
gttggatccg 
aagaggcaga 
atcagagcgc 
acatggacat 
tcaagatgga 
agcaggtctg 
gtggcggggc 
gcaagcacct 
cagacgtctt 
acatcaaagc 
acagggtatt 
tggtctttga 
acctttccac 
agcacgagga 
agctggctga 
ttactgctgt 
ctttggaaaa 
tagatatcat 
aacttaatga 
ttcaaactga 
aaatgaccag 
tcggaaacat 
aatatgaaca 
agatgtatgt 
cagggtccta 
accttattaa 
cgtgctgtga 
cgaagcgagc 
agtctcaggg 
ttcgaaaggg 
aagtgaaaga 
agttgggtgt 
gaacaccaac 



gacagcctct 
gctgcaccac 
gaggctgcag 
ccacggagaa 
agcattgcag 
ggataaatta 
gatttatcag 
gcagcgaaag 
gacgtggctg 
ggccgtgcag 
tgtcaacgtg 
cagtaacaag 
ggacgaggcg 
cttcctgcac 
ttggaatgat 
agagcagcgc 
cgtcatccac 
ggagctgctg 
tcttcatgaa 
gcagtgcgtg 
caacggagag 
gcagctccag 
gctgcaggtg 
gatccgggac 
agatcgcctc 
cagcgtcctc 
ggataagctg 
ggagcagaag 
cctgaaatac 
tcctgaacag. 
gcattactgg 
gaggagtgcc 
acacacctcc 
gttccagata 
tggcttttgt 
ggataagagg 
agccctgggg 
tccagccagt 
agagaagcgg 
aaaggcttat 
tggcgtggaa 
gcaagaaatc 
gttgccagag 
cacatattgc 
ttttgacgag 
accagtfccag 
ggaaggaaag 
caatgacgcc 
agaactcatc 
tcgagaacgg 
ttccagtggg 
cacagaacat 
ttcagataat 



gccaactact 
cagcggcacg 
ctgtgtgttt 
gcatttctga 
aaacgtcatg 
ctggaagcag 
gctgcccatc 
atcctactgg 
gaggagctgc 
gacctcatca 
atcaaggaag 
accccccaca 
cagtcgcaga 
gtgcgcatct 
gagctttctc 
ctccagcacc 
caagggcaag 
tgtgatagag 
aaacagcagg 
cagctgcgcc 
tccatgttaa 
cgagagcacg 
cagcagaagg 
tgcgccgaga 
aagctcgtca 
gagagcctgg 
ggcccaaact 
gaggcattcc 
ctgcacagga 
caagtgaaaa 
accatgagga 
aagcaggctt 
acgggctcca 
actgcaaagc 
gaaaaagggc 
tacagagatt 
atttcttcag 
atccctggct 
aaatctgccc 
gtaagagacc 
gagattccac 
tacgaatttc 
gatgttggac 
aaaaataagc 
atacagcagc 
cgaataacga 
ggagagatta 
atgcacctca 
ctacaggaat 
catctcttcc 
agaagcaagt 
gttgaaggag 
aaaattgtcc 



ccaaggccgt 
tgagaacaat 
tccagcagga 
gcaaacatac 
aagattttga 
cagaacagct 
agctggaaga 
acatgtcagt 
agaaggagct 
agcgctttgg 

gggaggacct 

acagctccat 
tggaggagct 
tcgagaggga 
agcaaatgaa 
atgcagacaa 
atcttctgca 
atgtagacat 
aattggattt 
acctgcaggc 
atgccggact 
agcagttcca 
cagaagccat 
aggtggcgtc 
acgcctctgt 
aacaggagta 
ctgagacgga 
tgaaggcttg 
acagcgtgaa 
atatcttgaa 
agagacggct 
tggaatggat 
gtatacagca 
aaaccaaaga 
atgcccatgc 
tctctctgcg 
attccaacaa 
cagaggtgaa 
gcaggaaaga 
tccgggaatg 
ctggcattgt 
ataataacat 
attgttttgt 
ctgattctac 
gacatggatt 
aatatcagct 
aagatggcct 
gcatgctgga 
ccttccaagt 
tttttgaaat 
acctttataa 
acccttgcaa 
ttaaggcttc 



gcaccatgtc 


1500 


ctggcaacac 


1560 


agttcagcag 


1620 


aggtgtgggg 


1680 


agaagtggca 


1740 


ggctcagact 


1800 


ccggattcaa 


1860 


gtcctttcac 


1920 


gctggacgac 


1980 


c cage age ag 


2040 


catccagcag 


2100 


caaccacatt 


2160 


cttccaggag 


2220 


cgccatcgac 


2280 


tgacttcgac 


2340 


agccttgacc 


2400 


gtatgtcaat 


2460 


ggcaactegg- 


2520 


ageegcagag 


2580 


agaagtgaaa 


2640 


tatcacagcc 


2700 


geatgecatt 


2760 


gctacaggcc 


2820 


tcactggcaa 


2880 


cgctttctac 


2940 


caagagagaa 


3000 


ccacgtgacg 


3060 


cacccttgct 


3120 


catgecagga 


3180 


tgaactcttc 


3240 


ggaccagtgt 


3300 


ccatgacaat 


3360; 


cacccaggag 


3420 


gagagtgaag 


3480 


ggcagagata 


3540 


gatggagaag 


3600 


atcgagtaaa 


3660 


acttcgagat 


3720 


gttcataatg 


3780 


tatggatacg 


3840 


aaacaaagaa 


3900 


attcctaaag 


3960 


tacttgggca 


4020 


tcagctgata 


4080 


agccaattcc 


4140 


ccttttaaaa 


4200 


ggaggtgatg 


4260 


agggtttgat 


4320 


gtgggaccca 


4380 


gtccttagta 


4440 


aagcaaattg 


4500 


atttgeactg 


4560 


cagcatagag 


4620 
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aacaagcagg actggataaa gcatatccgc gaagtcatcc aggagcggac gatccacctg 4680 

aagggagccc tgaaggagcc cattcacatc cctaagaccg ctcccgccac aagacagaag 4740 

ggaaggaggg atggagagga tctggacagc caaggagacg gcagcagcca gcctgatacg 4800 

atttccatcg cctcacggac gtctcagaac acgctggaca gcgataagct ctctggtggc 4860 

tgtgagctga cagtggtgat ccatgacttc accgcttgca acagcaacga gctgaccatc 4920 

cgacggggcc agaccgtgga agttctggag cggccgcatg acaagcctga ctggtgtctg 4980 

gtgcggacca ctgaccgctc cccagcggca gaaggcctgg tcccctgtgg ttcactgtgc 5040 

atcgcccact ccagaagtag catggaaatg gagggcatct tcaaccacaa agactcgctc 5100 

tccgtctcca gcaatgacgc cagtccaccc gcatccgtgg cttccctcca gccccacatg 5160 

atcggggccc agagctcgcc gggccccaag cggccgggca acaccctgcg caagtggctc 5220 

accagccccg tgcggcggct cagcagcggc aaggccgacg ggcacgtgaa gaagctggcg 5280 

cacaagcaca agaagagccg cgaggtccgc aagagcgccg acgccggctc gcagaaggac 5340 

tccgacgaca gtgcggccac cccgcaggac gagacggtcg aggagagagg ccggaacgag 5400 

ggcctgagca gcggtactct ctccaaatcc tcctcctcgg ggatgcagag ctgtggagaa 5460 

gaggaaggcg aggagggggc cgacgccgtg cccctgccgc cacccatggc catccagcag 5520 

cacagcctcc tccagccaga ctcacaggat gacaaggcct cttctcggtt attagtccgc 5580 

cccaccagct ccgaaacacc gagtgcagcc gagctcgtca gtgcaattga ggaactcgtg 564 0 

aaaagcaaga tggcactgga ggatcgcccc agctcactcc ttgttgacca gggagatagt 5700 

agcagccctt ccttcaaccc ttcggataat tcccttctct cttcctcctc gcccattgat 5760 

gagatggaag aaaggaaatc cagctcttta aagagaagac actacgtttt gcaagaacta 582 0 

gtggagacag agcgtgacta tgtgcgggac cttggctatg tggttgaggg ctacatggca 5880 

cttatgaaag aagatggtgt tcctgatgac atgaaaggaa aagacaaaat tgtgttcggc 5940 

aacatccatc agatttacga ctggcacaga gacttttttt taggagagtt agagaagtgc 6000 

cttgaagatc cagaaaaact aggatccctt tttgttaaac acgagagaag gttg.cacatg 6060 

tacatagctt attgtcaaaa taaaccaaag tctgagcaca ttgtctcaga atacattgat 612 0 

accttttttg aggacttaaa gcagcgtctt ggccacaggt tacagctcac agatctgttg 6180 

atcaaaccag tgcagagaat catgaagtat cagctgttac tgaaggactt cctcaagtat 6240 

tccaaaaagg ccagcctgga tacatcagaa ttagagagag ctgtggaagt catgtgcata 6300 

gtacccaggc ggtgcaacga catgatgaac gtggggcggc tgcaaggatt cgacgggaaa 6360 

atcgttgccc agggtaaact gctcttgcag gacacattct tggtcacaga ccaagatgca 6420 

ggacttctgc ctcgctgcag agagaggcgc atcttcctct ttgagcagat cgtcatattc 6480 

agcgaaccac ttgataaaaa gaagggcttc tccatgccgg gattcctgtt taagaacagt 654 0 

atcaaggtga gttgcctttg cctggaggaa aatgtggaaa atgatccctg taaatttgct 6600 

ctgacatcga ggacgggtga cgtggtagag accttcattt tgcattcatc tagtccaagt 6660 

gtccggcaaa cttggatcca tgaaatcaac caaattttag aaaaccagcg caatttttta 6720 

aatgccttga catcgccaat cgagtaccag aggaaccaca gcgggggcgg cggcggcggc 6780 

ggcagcgggg cagcggcggg ggtgggggca gcggcggcgg cggggccccc agtggcggca 684 0 

gcggccacag tggcggcccc agcagctgcg gcggcgcccc cagcacgagc aggagccggc 6900 

cctcccggat cccccagcct gtccgacacc acccccccgt gctggtctcc tctgcagcct 6960 

cgagccaggc agaggcagac aagatgtcag agtgaaagca gcagcagtag caacatctcc 702 0 

accatgttgg tgacacacga ttacacggca gtgaaggagg atgagatcaa cgtctaccaa 7080 

ggagaggtcg ttcaaattct ggccagcaac cagcagaaca tgtttctggt gttccgagcc 7140 

gccactgacc agtgccccgc agctgagggc tggattccag gctttgtcct gggccacacc 7200 

agtgcagtca tcgtggagaa cccggacggg actctcaaga agtcaacatc ttggcacaca 7260 

gcactccgtt taaggaaaaa atctgagaaa aaagataaag acggcaaaag ggaaggcaag 7320 

ttagagaacg gttatcggaa gtcacgggaa ggactcagca acaaggtatc -tgtgaagctt 73 80 

ctcaatccca actacattta tgacgttccc ccagaattcg tcattccatt gagtgaggtc 744 0 

acgtgtgaga caggggagac cgttgttctt agatgtcgag tctgtggccg ccccaaagcc 7500 

tcaattacct ggaagggccc tgaacacaac accttgaaca acgatggtca ctacagcatc 7560 

tcctacagtg acctgggaga ggccacgctg aagattgtgg gcgtgaccac ggaagatgac 7620 

ggcatctaca cgtgcatcgc tgtcaatgac atgggttcag cctcatcatc ggccagcctg 7680 

agggtcctag gtccagggat ggatgggatc atggtgacct ggaaagacaa ctttgactcc 7740 

ttctacagtg aagtggctga gcttggcagg ggcagattct ctgtcgttaa gaaatgtgat 7800 
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cagaaaggaa ccaagcgagc agtggccact aagtttgtga acaagaagtt gatgaagcgc 7860 

gaccaggtca cccatgagct tggcatcctg cagagcctcc agcaccccct gcttgtcggc 792 0 

ctcctcgaca cctttgagac ccccaccagc tacatcctgg tcttagaaat ggctgaccag 7980 

ggtcgcctcc tggactgcgt ggtgcgatgg ggaagcctca ctgaagggaa gatcagggcg 8 04 0 

cacctggggg aggttctgga agctgtccgg tacctgcaca actgcaggat agcacacctg 810 0 

gacctaaagc ctgagaatat cctggtggat gagagtttag ccaagccaac catcaaactg 8160 

gctgactttg gagatgctgt tcagctcaac acgacctact acatccacca gttactgggg 822 0 

aaccctgaat tcgcagcccc tgaaatcatc ctcgggaacc ctgtctccct gacctcggat 8280 

acgtggagtg ttggagtgct cacatacgta cttcttagtg gcgtgtcccc cttcctggat 8340 

gacagtgtgg aagagacctg cctgaacatfc tgccgcttag actttagctt cccagatgac 8400 

tactttaaag gagtgagcca gaaggccaag gagttcgtgt gcttcctcct gcaggaggac 8460 

cccgccaagc gtccctcggc tgcgctggcc ctccaggagc agtggctgca ggccggcaac 852 0 

ggcagaagca cgggcgtcct cgacacgtcc agactgactt ccttcattga gcggcgcaaa 8580 

caccagaatg atgttcgacc tatccgtagc attaaaaact ttctgcagag caggcttctg 8640 

cctagagttt gacctatcca gaagttcttt ctcattctct ttcacctgcc aatcagctgt 8700 

taatctgaat tttcaagaga aaacaagcaa acataactga tcagctgccg gtatgttcat 8760 

cgtgtgaaat tgcattccaa gtgagctgtg ctcagcagtg cttggacaca gagctgcaag 8 820 

ctgcgctggg gtggaggacc gtcacttaca ctctgccaag gacggaggtc gcattgctgt 8880 

atcacagtat tttttacgga tttctg 8 906 

<210> 86 

<211> 1204 

<212> DNA 

<213> Homo sapiens 

<400> 86 

tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 60 

ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 12 0 
tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa . 180 

tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 24 0 

gagcaaagtg cctgcattcg tgaggatgat tgctcccgag ggctccttgg tgtttcatga 300 

gaaagcctgg aatgcgtacc cctactgtag aacaattgta acgaatgaat atatgaaaga 360 

tgatttcttc attaaaatcg aaacatggca caaaccagac ttgggaacat tagaaaatgt 420 

acatggttta gatccaaaca catggaaaac tgttgaaatt gtccatatag atattgcaga 480 

tagaagtcaa gttgaaccag cagactacaa agctgatgaa gacccagcat tattccagtc 540 

agtcaagacc aagagaggcc ctttgggacc caactggaag aaggagctgg caaacagccc 600 

tgactgtccc cagatgtgtg cctataagct ggtgaccatc aaattcaagt ggtggggact 660 

gcaaagcaaa gtagaaaact tcattcaaaa gcaagaaaaa cggatattta caaacttcca 720 

tcgccagctt ttttgttgga ttgacaagtg gatcgatctc acgatggaag acattaggag 780 

aatggaagac gagactcaga aagaactaga aacaatgcgt aagaggggtt ccgttcgagg 840 

cacgtcggct gctgatgtct agatgagtcc cctgtagggt cagagacaat gtcaaactgt 900 

ttacgtaatc aaggtcaagt gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 960 

tgaccagtat cttgcagtgt tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 102 0 

tgcacaggtt ctcaaccacg tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1080 

. taaagcgcat gtagagctac agatecagat acacacactt gtgtatatat gtacatacag 1140 

acatactgaa gggattagta caatttctcc aaagtactgt acctatcttc agcaagaatg 1200 

caaa 12 04 

<210> 87 

<211> 892 

<212> PRT 

<213> Homo sapiens 
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<400> 87 

Met Asp His Tyr Asp Ser Gin Gin Thr Asn Asp Tyr Met Gin Pro Glu 

15 10 15 

Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gin 

20 25 30 

Gin Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala 

35 40 45 

Gly Thr Gin He Glu Asn He Glu Glu Asp Phe Arg Asp Gly Leu Lys 

5 0 55 60 

Leu Met Leu Leu Leu Glu Val He Ser Gly Glu Arg Leu Ala Lys Pro 
65 70 75 80 

Glu Arg Gly Lys Met Arg Val His Lys He Ser Asn Val Asn Lys Ala 

85 90 95 

Leu Asp Phe He Ala Ser Lys Gly Val Lys Leu Val Ser He Gly Ala 

100 105 110 

Glu Glu He Val Asp Gly Asn Val Lys Met Thr Leu Gly Met He Trp 

115 120 125 

Thr He He Leu Arg Phe Ala He Gin Asp He Ser Val Glu Glu Thr 

130 135 140 

Ser Ala Lys Glu Gly Leu Leu Leu Trp Cys Gin Arg Lys Thr Ala Pro 
14 5 150 155. 160 

Tyr Lys Asn Val Asn He Gin Asn Phe His He Ser Trp Lys Asp Gly 

165 170 ^ 175 

Leu Gly Phe Cys Ala Leu He His Arg His Arg Pro Glu Leu He Asp 

180 185 190 

Tyr Gly Lys Leu Arg Lys Asp Asp Pro Leu Thr Asn Leu Asn Thr Ala 

195 200 205 

Phe Asp Val Ala Glu Lys Tyr Leu Asp He Pro Lys Met Leu Asp Ala 

210 215 220 

Glu Asp He Val Gly Thr Ala Arg Pro Asp Glu Lys Ala He Met Thr 
225 230 235 . 240 

Tyr Val Ser Ser Phe Tyr His Ala Phe Ser Gly Ala Gin Lys Ala Glu 

245 250 255 

Thr Ala Ala Asn Arg He Cys Lys Val Leu Ala Val Asn Gin Glu Asn 

260 265 270 

Glu Gin Leu Met Glu Asp Tyr Glu Lys Leu Ala Ser Asp Leu Leu Glu 

275 . 280 285 

Trp He Arg Arg Thr He Pro Trp Leu Glu Asn Arg Val Pro Glu Asn 

290 295 300 

Thr Met His Ala Met Gin Gin Lys Leu Glu Asp Phe Arg Asp Tyr Arg 
3 °5 310 315 ~ 320 

Arg Leu His Lys Pro Pro Lys Val Gin Glu Lys Cys Gin Leu Glu He 

. 325 330 335 

Asn Phe Asn Thr Leu Gin Thr Lys Leu Arg Leu Ser Asn Arg Pro Ala 

340 345 350 

Phe Met Pro Ser Glu Gly Arg Met Val Ser Asp He Asn Asn Ala Trp 

355 360 365 

Gly Cys Leu Glu Gin Val Glu Lys Gly Tyr Glu Glu Trp Leu Leu Asn 

370 375 380 

Glu He Arg Arg Leu Glu Arg Leu Asp His Leu Ala Glu Lys Phe Arg 
3Q 5 390 395 400 

Gin Lys Ala Ser He His Glu Ala Trp Thr Asp Gly Lys Glu Ala Met 
405 410 415- 
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Leu 


Arg 


Gin 


Lys 
420 


Asp 


Tyr 


Glu 


Thr 


Ala 

425 


Thr 


Leu 


Ser 


Glu 


He 
430 


Lys 


Ala 


Leu 


Leu 


Lys 


Lys 


His 


Glu 


Ala 


Phe 


Glu 


Ser Asp 


Leu 


Ala 


Ala 


His 


Gin 






435 










440 










445 








Asp 


Arg 
450 


Val 


Glu 


Gin 


He 


Ala 

455 


Ala 


He 


Ala 


Gin 


Glu 
460 


Leu 


Asn 


Glu 


Leu 


Asp 


Tyr 


Tyr 


Asp 


Ser 


Pro 


Ser 


Val 


Asn 


Ala 


Arg 


Cys 


Gin 


Lys 


He 


Cys 


465 










470 










475 










480 


Asp 


Gin 


Trp 


Asp 


Asn 


Leu 


Gly Ala 


Leu 


Thr 


Gin 


Lys 


Arg 


Arg 


Glu 


Ala 










485 










490 










495 




Leu 


Glu 


Arg 


Thr 
500 


Glu 


Lys 


Leu 


Leu 


Glu 
505 


Thr 


He 


Asp 


Gin 


Leu 
510 


Tyr 


Leu 


Glu 


Tyr 


Ala 
515 


Lys 


Arg 


Ala 


Ala 


Pro 
520 


Phe 


Asn 


Asn 


Trp 


Met 
525 


Glu 


Gly 


Ala 


Met 


Glu 
530 


Asp 


Leu 


Gin 


Asp 


Thr 
535 


Phe 


He 


Val 


His 


Thr 
540 


He 


Glu 


Glu 


He 


Gin 


Gly 


Leu 


Thr 


Thr 


Ala 


His 


Glu 


Gin 


Phe 


Lys 


Ala 


Thr 


, Leu 


Pro 


Asp 


545 










550 










555 










560 


Ala 


Asp 


Lys 


Glu 


Arg 


Leu 


Ala 


lie 


Leu 


Gly He 


His 


Asn 


Glu 


Val 


Ser 










565 










570 










575 




Lys 


He 


Val 


Gin 


Thr 


Tyr 


His 


Val 


Asn 


Met 


Ala 


Gly Thr 


Asn 


Pro 


Tyr 








580 










585 










590 






Thr 


Thr 


He 


Thr 


Pro 


Gin 


Glu 


He Asn Gly Lys 


Trp 


Asp 


His 


Val 


Arg 






595 










600 










605 








Gin 


Leu 
610 


Val 


Pro 


Arg 


Arg 


Asp 
615 


Gin 


Ala 


Leu 


Thr 


Glu 
620 


Glu 


His 


Ala 


Arg 


Gin 


Gin 


His 


Asn 


Glu 


Arg 


Leu Arg 


Lys 


Gin 


Phe 


Gly Ala 


Gin 


Ala 


Asn 


625 










630 










635 










640 


Val 


He 


Gly 


Pro 


Trp 


He 


Gin Thr Lys 


Met 


Glu 


Glu 


He 


Gly 


Arg 


He 










645 










650 










655 




Ser 


He 


Glu 


Met 


His 


Gly 


Thr 


Leu Glu Asp 


Gin 


Leu 


Ser 


His 


Leu 


Arg 








660 










665 










670 






Gin 


Tyr 


Glu 
675 


Lys 


Ser 


He 


Val 


Asn 
680 


Tyr 


Lys 


Pro 


Lys 


He 
685 


Asp 


Gin 


Leu 


Glu 


Gly 
690 


Asp 


His 


Gin 


Leu 


He 
695 


Gin 


Glu 


Ala 


Leu 


He 
700 


Phe 


Asp 


Asn 


Lys 


His 


Thr 


Asn 


Tyr 


Thr 


Met 


Glu 


His 


He Arg Val 


Gly Trp 


Glu 


Gin 


Leu 


705 










710 










715 










720 


Leu 


Thr 


Thr 


He 


Ala 
725 


Arg 


Thr 


He 


Asn 


Glu 
730 


Val 


Glu 


Asn 


Gin 


He 
735 


Leu 


Thr 


Arg 


Asp 


Ala 
740 


Lys 


Gly 


He 


Ser 


Gin 
745 


Glu 


Gin 


Met 


Asn 


Glu 
750 


Phe 


Arg 


Ala 


Ser 


Phe 


Asn 


His 


Phe 


Asp Arg Asp His 


Ser 


Gly Thr 


Leu 


Gly 


Pro 






755 










760 










765 








Glu 


Glu 


Phe 


Lys 


Ala 


Cys 


Leu 


He 


Ser Leu Gly 


Tyr Asp 


He 


Gly 


Asn 




770 










775 










780 










Asp 


Pro 


Gin 


Gly 


Glu 


Ala 


Glu 


Phe 


Ala- Arg 


He 


Met 


Ser 


He 


Val 


Asp 


785 










790 










795 










800 


Pro 


Asn 


Arg 


Leu 


Gly Val 


Val 


Thr 


Phe 


Gin 


Ala 


Phe 


He 


Asp 


Phe 


Met 










805 










810 










815 




Ser 


Arg 


Glu 


Thr 


Ala 


Asp 


Thr 


Asp 


Thr Ala Asp 


Gin 


Val 


Met 


Ala 


Ser 








820 










825 










830 






Phe 


Lys 


He 


Leu 


Ala 


Gly 


Asp 


Lys 


Asn 


Tyr 


He 


Thr 


Met 


Asp 


Glu 


Leu 
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835 840 
Arg Arg Glu Leu Pro Pro Asp Gin 

850 855 
Ala Pro Tyr Thr Gly Pro Asp Ser 
865 870 
Ser Phe Ser Thr Ala Leu Tyr Gly 
885 



845 

Ala Glu Tyr Cys lie Ala Arg Met 
860 

Val Pro Gly Ala Leu Asp Tyr Met 
875 880 
Glu Ser Asp Leu 
890 



<210> 88 
<211> 197 
<212> PRT 

<213> Homo sapiens 



<400> 88 






















Met Met Phe 


Pro 


Gin 


Ser Arg His 


Ser 


Gly Ser 


Ser 


His 


Leu 


Pro 


Gin 


1 




5 






10 








15 




Gin Leu Lys 


Phe 


Thr 


Thr Ser Asp 


Ser 


Cys Asp 


Arg 


He 


Lys 


Asp 


Glu 




20 






25 








30 






Phe Gin Leu 


Leu 


Gin 


Ala Gin Tyr 


His 


Ser Leu 


Lys 


Leu 


Glu 


Cys 


Asp 


35 






40 








45 








Lys Leu Ala 


Ser 


Glu 


Lys Ser Glu Met Gin Arg His 


Tyr Val 


Met 


Tyr 


50 






55 






60 










Tyr Glu Met 


Ser 


Tyr 


Gly Leu Asn 


lie 


Glu Met 


His 


Lys 


Gin 


Ala 


Glu 


65 






70 




75 










80 


lie Val Lys 


Arg 


Leu Asn Gly lie 


Cys 


Ala Gin 


Val 


Leu 


Pro 


Tyr 


Leu 






85 






90 








95 




Ser Gin Glu 


His 


Gin 


Gin Gin Val 


Leu 


Gly Ala 


He 


Glu 


Arg 


Ala 


Lys 




100 






105 








110 






Gin Val Thr 


Ala 


Pro 


Glu Leu Asn 


Ser 


He. lie 


Arg 


Gin 


Gin 


Leu 


Gin 


115 






120 








125 








Ala His Gin 


Leu 


Ser 


Gin Leu Gin 


Ala 


Leu Ala 


Leu 


Pro 


Leu 


Thr 


Pro 


130 






135 






140 










Leu Pro Val 


Gly 


Leu 


Gin Pro Pro 


Ser 


Leu Pro 


Ala 


Val 


Ser 


Ala 


Gly 


145 






150 




155 










160 


Thr Gly Leu 


Leu 


Ser 


Leu Ser Ala 


Leu 


Gly Ser 


Gin 


Ala 


His 


Leu 


Ser 






165 






170 








175 




Lys Glu Asp 


Lys 


Asn Gly His Asp Gly Asp Thr His 


Gin Glu Asp Asp 




180 






185 








19 0- 






Gly Glu Lys 


Ser 


Asp 


















195 























<210> 89 . 
<211> 739 
<212> PRT 

<213> Homo sapiens 
<400> 89 

Gly Asp Lys Glu Pro Thr Glu Thr He Gly Asp Leu Ser He Cys Leu 

15 10 15 

Asp Gly Leu Gin Leu Glu Ser Glu Val Val Thr Asn Gly Glu Thr Thr 

20 25 30 

Cys Ser Glu Ser Ala Ser Gin Asn Asp Asp Gly Ser Arg Ser Lys Asp 
35 4 0 45 
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Glu 


Thr 


Arg 


Val 


Ser 


Thr 


Asn 


Gly 


Ser 


Asp 


Asp 


Pro 


Glu Asp 


Ala 


Gly 




50 










55 










60 










Ala 


Gly 


Glu 


Asn 


Arg 


Arg 


Val 


Ser 


Gly Asn Asn 


Ser 


Pro 


Ser 


Leu 


Ser 


65 










70 










75 










80 


Asn 


Gly 


Gly 


Phe 


Lys 


Pro 


Ser 


Arg 


Pro 


Pro Arg 


Pro 


Ser Arg 


Pro 


Pro 










85 










90 










95 




Pro 


Pro 


Thr 


Pro 


Arg 


Arg 


Pro 


Ala 


Ser 


Val 


Asn 


Gly 


Ser 


Pro 


Ser 


Ala 








100 










105 










110 






Thr 


Ser 


Glu 


Ser 


Asp 


Gly 


Ser 


Ser 


Thr 


Gly 


Ser 


Leu 


Pro 


Pro 


Thr 


Asn 






115 










120 










125 








Thr 


Asn 


Thr 


Asn 


Thr 


Ser 


Glu 


Gly Ala 


Thr 


Ser 


Gly 


Leu 


He 


He 


Pro 




130 










135 










14 0 










Leu 


Thr 


He 


Ser 


Gly Gly 


Ser 


Gly Pro Arg 


Pro 


Leu 


Asn 


Pro 


Val 


Thr 


145 










150 










155 










160 


Gin 


Ala 


Pro 


Leu 


Pro 


Pro 


Gly 


Trp 


Glu Gin Arg 


Val 


Asp 


Gin 


His 


Gly 










165 










170 










175 




Arg 


Val 


Tyr 


Tyr 


Val 


Asp 


His 


Val 


Glu 


Lys 


Arg 


Thr 


Thr 


Trp 


Asp 


Arg 








180 










185 










190 






Pro 


Glu 


Pro 


Leu 


Pro 


Pro 


Gly 


Trp 


Glu 


Arg Arg 


Val 


Asp 


Asn 


Met 


Gly 






195 










200 










205 








Arg 


lie 


Tyr 


Tyr 


Val 


As P 


His 


Phe 


Thr 


Arg 


Thr 


Thr 


Thr 


Trp 


Gin 


Arg 




210 










215 










220 










Pro 


Thr 


Leu 


Glu 


Ser 


Val 


Arg 


Asn 


Tyr 


Glu 


Gin 


Trp 


Gin 


Leu 


Gin 


Arg 


225 










230 










235 










240 


Ser 


Gin 


Leu 


Gin 


Gly Ala 


Met 


Gin 


Gin 


Phe 


Asn 


Gin 


Arg 


Phe 


He 


Tyr 










245 










250 










255 




Gly 


Asn 


Gin 


Asp 


Leu 


Phe 


Ala 


Thr 


Ser 


Gin 


Ser 


Lys 


Glu 


Phe 


Asp 


Pro 








260 










265 










270 






Leu 


Gly 


Pro 


Leu 


Pro 


Pro 


Gly 


Trp 


Glu 


Lys 


Arg 


Thr Asp 


Ser 


Asn 


Gly 






275 










280 










285 








Arg 


Val 


Tyr 


Phe 


Val 


Asn 


His 


Asn 


Thr 


Arg 


He 


Thr 


Gin 


Trp 


Glu 


Asp 




290 










295 










300 










Pro 


Arg 


Ser 


Gin 


Gly Gin 


Leu 


Asn 


Glu 


Lys 


Pro 


Leu 


Pro 


Glu 


Gly 


Trp 


305 










310 










315 










320 


Glu 


Met 


Arg 


Phe 


Thr 


Val 


Asp 


Gly 


He 


Pro 


Tyr 


Phe 


Val 


Asp 


His 


Asn 










325 










330 










335 




Arg 


Arg 


Thr 


Thr 


Thr 


Tyr 


He 


Asp 


Pro 


Arg 


Thr 


Gly Lys 


Ser 


Ala 


Leu 








340 










345 










350 






Asp 


Asn 


Gly 


Pro 


Gin 


He 


Ala 


Tyr 


Val 


Arg 


Asp 


Phe 


Lys 


Ala 


Lys 


Val 






355 










360 










365 








Gin 


Tyr 


Phe 


Arg 


Phe 


Trp 


Cys 


Gin 


Gin 


Leu 


Ala 


Met 


Pro 


Gin 


His 


He 




370 










375 










380 










Lys 


He 


Thr 


Val 


Thr 


Arg 


Lys 


Thr 


Leu 


Phe 


Glu 


Asp 


Ser 


Phe 


Gin 


Gin 


385 










390 










395 










400 


lie 


Met 


Ser 


Phe 


Ser 


Pro 


Gin 


Asp 


Leu Arg Arg 


Arg 


Leu 


Trp 


Val 


He 










405 










410 










415 




Phe 


Pro 


Gly 


Glu 


Glu Gly 


Leu 


Asp Tyr Gly Gly 


Val 


Ala 


Arg 


Glu 


Trp 








420 










425 










430 






Phe 


Phe 


Leu 


Leu 


Ser 


His 


Glu 


Val 


Leu 


Asn 


Pro 


Met 


Tyr 


Cys 


Leu 


Phe 






435 










440 










445 








Glu 


Tyr 


Ala 


Gly 


Lys 


Asp 


Asn 


Tyr 


Cys 


Leu 


Gin 


He 


Asn 


Pro 


Ala 


Ser 




450 










455 










460 










Tyr 


He 


Asn 


Pro 


Asp 


His 


Leu 


Lys 


Tyr 


Phe 


Arg 


Phe 


He Gly 


Arg 


Phe 
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465 



470 



'475 



480 



lie 


Ala 


Met 


Ala 


Leu 


Phe 


His 


Gly Lys 


Phe 


He 


Asp 


Thr 


Gly 


Phe 


Ser 










485 










. 490 










495 




Leu 


Pro 


Phe 


Tyr 


Lys 


Arg 


He 


Leu 


Asn 


Lys 


Pro 


Val 


Gly Leu 


Lys 


Asp 








500 










505 










510 






Leu 


Glu 


Ser 


He 


Asp 


Pro 


Glu 


Phe 


Tyr 


Asn 


Ser 


Leu 


He 


Trp 


Val 


Lys 






515 










520 










525 








Glu 


Asn 


Asn 


He 


Glu 


Glu 


Cys 


Asp 


Leu 


Glu 


Met 


Tyr 


Phe 


Ser 


Val 


Asp 




530 










535 










540 










i»ys 


Glu 


lie 


Leu 


Gly Glu 


He 


L ys 


Ser 


His 


Asp 


Leu 


Lys 


Pro 


Asn 


Gly 


545 










550 










555 










560 


Gly Asn 


lie 


Leu 


Val 


Thr 


Glu 


Glu 


Asn 


Lys 


Glu 


Glu 


Tyr 


He 


Arg 


Met 










565 










570 










575 




Val 


Ala 


Glu 


Trp 


Arg 


Leu 


Ser Arg Gly Val 


Glu 


Glu 


Gin 


Thr 


Gin 


Ala 








580 










585 










590 






Phe 


Phe 


Glu 


Gly 


Phe 


Asn 


Glu 


He 


Leu 


Pro 


Gin 


Gin 


Tyr 


Leu 


Gin 


Tyr 






595 










600 










605 








Phe 


Asp 


Ala 


Lys 


Glu 


Leu 


Glu 


Val 


Leu 


Leu 


Cys 


Gly Met 


Gin 


Glu 


He 




610 










615 










620 










Asp 


Leu 


Asn 


Asp 


Trp 


Gin 


Arg 


His 


Ala 


He 


Tyr Arg 


His 


Tyr 


Ala 


Arg 


625 










630 










635 










640 


Thr 


Ser 


Lys 


Gin 


He 


Met 


Trp 


Phe 


Trp 


Gin 


Phe 


Val 


Lys 


Glu 


He 


Asp 










645 










650 










655 




Asn Glu Lys 


Arg 


Met 


Arg 


Leu 


Leu 


Gin 


Phe 


Val 


Thr Gly Thr 


Cys 


Arg 








660 










665 










670 






Leu 


Pro 


Val 


Gly 


Gly 


Phe 


Ala 


Asp 


Leu 


Met 


Gly 


Ser 


Asn Gly 


Pro 


Gin 






675 










680 










685 








Lys 


Phe 


Cys 


He 


Glu 


Lys 


Val 


Gly Lys 


Glu 


Asn 


Trp 


Leu 


Pro 


Arg 


Ser 




690 










695 










700 










His 


Thr 


Cys 


Phe 


Asn Arg 


Leu 


Asp 


Leu 


Pro 


Pro 


Tyr 


Lys 


Ser 


Tyr 


Glu 


705 










710 










715 










720 


Gin 


Leu 


Lys 


Glu 


Lys 


Leu 


Leu 


Phe 


Ala 


He 


Glu 


Glu 


Thr Glu 


Gly 


Phe 










725 










730 










735 




Gly Gin Glu 





























<210> 90 
<211> 431 
<212> PRT 

<213> Homo sapiens 
<400> 90 

Gly Pro Pro Pro Thr Arg Ala Leu Pro Leu Pro Gin Ser Leu Pro Pro 

15 10 15 

Asp Phe Arg Leu Glu Pro Thr Ala Pro Ala Leu Ser Pro Arg Ser Ser 

20 25 30 

Phe Ala Ser Ser Ser Ala Ser Asp Ala Ser Lys Pro Ser Ser Pro Arg 

35 . 40 45 

Gly Ser Leu Leu Leu Asp Gly Ala Gly Ala Gly Gly Ala Gly Gly Ser 

5 0 55 60 

Arg Pro Cys Ser Asn Arg Thr Ser Gly He Ser Met Gly Tyr Asp Gin 
65 70 75 80 

Arg His Gly Ser Pro Leu Pro Ala Gly Pro Cys Leu Phe Gly Pro Pro 
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Leu 


Ala 


Gly 


Ala 


Pro 


Ala 


Gly Tyr Ser 








100 








105 


Tyr 


Pro 


Glu 


Leu 


His 


Ala 


Ala 


Leu Asp 






115 










12 0 


Ala 


Gly 


Phe 


Gly 


Cys 


Gin 


Glu 


Ser Arg 




130 










135 




Gly 


Ser 


Pro 


Gly 


Ala 


Leu 


Ala 


Gly Ala 


145 










150 






Leu 


Glu 


Arg 


Arg 


Gly 


Ala 


Gin 


Pro Gly 










165 








Gly 


Asp 


Cys 


Ala 


Val 


Gly Ala Arg Tyr 








180 








185 


Leu 


Arg 


Leu 


Thr 


Val 


Gly Thr 


Gly Gly 






195 










200 


Glu 


Pro 


Ser 


Gly 


He 


Glu 


Pro 


Ser Gly 




210 










215 




Phe 


Val 


Pro 


Glu 


Ala 


Ala 


Arg 


Ala Arg 


225 










230 






Glu 


Asp 


Tyr 


Phe 


Gly 


Thr 


Cys 


He Lys 










245 








Gin 


Ser 


Asn 


Ala 


Cys 


Gin 


Ala 


Leu Asp 








260 








265 


Phe 


Val 


Cys 


Cys 


Ser 


Cys 


Gly Arg Thr 






275 










280 


Ser 


Val 


Asn 


Gly 


Ser 


Val 


Tyr 


Cys Glu 




290 










295 




Phe 


Gin 


Glu 


Ala 


Ala 


Glu 


Lys 


Cys Cys 


305 










310 






Glu 


Lys 


lie 


Leu 


Gin 


Ala 


Met 


Gly Lys 










325 








Arg 


Cys 


He 


Val 


Cys 


Asn 


Lys 


Cys Leu 








340 








345 


Asp 


Phe 


Ser 


Asn 


Gin 


Val 


Tyr 


Cys Val 






355 










360 


Ala 


Pro 


Lys 


Cys 


Ala 


Ala 


Cys 


Gly Gin 




370 










375 




Cys 


Glu 


Asp 


He 


Val 


Arg Val 


He Ser 


385 










390 






Glu 


Cys 


Tyr 


His 


Cys 


Glu 


Asp 


Cys Arg 










405 








Gly 


Cys 


Cys 


Cys 


Phe 


Pro 


Leu 


Asp Gly 



420 425 
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90 








95 




Pro Gly Gly Val 


Pro 


Ser 


Ala 








110 






Arg Leu Tyr Ala 


Gin 


Arg 


Pro 






125 








His Ser 


Tyr 


Pro 


Pro 


Ala 


Leu 




140 










Arg Val 


Gly Ala 


Ala 


Gly 


Pro 


155 










160 


Arg His 


Ser 


Val 


Thr 


Gly 


Tyr 


170 








175 




Gin Asp Glu Leu 


Thr 


Ala 


Leu 








190 






Arg Glu Ala Gly 


Ala 


Arg 


Gly 






205 








Leu Glu 


Glu 


Pro 


Pro 


Gly 


Pro 




220 










Met Arg 


Glu 


Pro 


Glu 


Ala 


Arg 


235 










240 


Cys Asn 


Lys 


Gly 


He 


Tyr 


Gly 


250 








255 




Ser Leu 


Tyr 


His 


Thr 


Gin 


Cys 








270 






Leu Arg 


Cys 


Lys 


Ala 


Phe 


Tyr 






285 








Glu Asp 


Tyr 


Leu 


Phe 


Ser 


Gly 




300 










Val Cys 


Gly His 


Leu 


He 


Leu 


315 










320 


Ser Tyr His 


Pro 


Gly 


Cys 


Phe 


330 








335 




Asp Gly He 


Pro 


Phe 


Thr 


Val 








350 






Thr Asp Tyr His 


Lys 


Asn 


Tyr 






365 








Pro He 


Leu 


Pro 


Ser 


Glu 


Gly 




380 










Met Asp 


Arg Asp 


Tyr 


His 


Phe 


395 










400 


Met Gin 


Leu 


Ser 


Asp 


Glu 


Glu 


410 








415 




His Leu 


Leu 


Cys 


His 


Gly 





430 



<400> 91 

Gly Pro Gly Ser Arg His His Arg Ala Arg Asp Arg Leu He His Phe 

1-5 10 15 * 

Gly Ala Val Ser Thr Asp Val Leu Gly Cys Ser Ala His Cys Ser Leu 
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20 








25 








30 






Thr 


Gin 


Ser 


Pro 


Lys 


Met Asn 


He 


Gin 


Glu 


Gin Gly Phe Pro 


Leu 


Asp 






35 








40 










45 






Leu Gly 


Ala 


Ser 


Phe 


Thr Glu Asp 


Ala 


Pro 


Arg 


Pro 


Pro Val 


Pro 


Gly 




50 








55 










60 








Glu 


Glu 


Gly 


Glu 


Leu 


Val Ser 


Thr 


Asp 


Pro 


Arg 


Pro 


Ala Ser 


Tyr 


Ser 


65 










70 








75 








80 


Phe 


Cys 


Ser 


Gly 


Lys 


Gly Val 


Gly 


He 


Lys 


Gly Glu Thr Ser 


Thr 


Ala 










85 








90 








95 




Thr 


Pro 


Arg 


Arg 


Ser 


Asp Leu Asp 


Leu 


Gly 


Tyr 


Glu 


Pro Glu 


Gly Ser 








100 








105 








110 






Ala 


Ser 


Pro 


Thr 


Pro 


Pro Tyr 


Leu 


Lys 


Trp 


Ala 


Glu 


Ser Leu 


His 


Ser 






115 








120 










125 






Leu 


Leu 


Asp 


Asp 


Gin 


Asp Gly 


He 


Ser 


Leu 


Phe 


Arg 


Thr Phe 


Leu 


Lys 




130 








135 










140 








Gin 


Glu 


Gly 


Cys 


Ala 


Asp Leu 


Leu 


Asp 


Phe 


Trp 


Phe 


Ala Cys 


Thr 


Gly 


145 










150 








155 








160 


Phe 


Arg 


Lys 


Leu 


Glu 


Pro Cys 


Asp 


Ser 


Asn 


Glu 


Glu 


Lys Arg 


Leu 


Lys 










165 








170 








175 




Leu 


Ala 


Arg 


Ala 


lie 


Tyr Arg 


Lys 


Tyr 


He 


Leu 


Asp 


Asn Asn 


Gly 


lie 








180 








185 








190 






Val 


Ser 


Arg 


Gin 


Thr 


Lys Pro 


Ala 


Thr 


Lys 


Ser 


Phe 


He Lys 


Gly Cys 






195 








200 










205 






lie 


Met 


Lys 


Gin 


Leu 


He Asp 


Pro 


Ala 


Met 


Phe 


Asp 


Gin Ala 


Gin 


Thr 




210 








215 










220 








Glu 


He 


Gin 


Ala 


Thr 


Met Glu 


Glu 


Asn 


Thr 


Tyr 


Pro 


Ser Phe 


Leu 


Lys 


225 










230 








235 








240 


Ser Asp 


He 


Tyr 


Leu 


Glu Tyr 


Thr 


Arg 


Thr 


Gly 


Ser 


Glu Ser 


Pro 


Lys 










245 








250 








255 




Val 


Cys 


Ser 


Asp 


Gin 


Ser Ser Gly 


Ser Gly 


Thr Gly Lys Gly 


He 


Ser 








260 








265 








270 






Gly Tyr 


Leu 


Pro 


Thr 


Leu Asn 


Glu 


Asp 


Glu 


Glu 


Trp- Lys Cys 


Asp 


Gin 






275 








280 










285 






Asp 


Met 


Asp 


Glu 


Asp 


Asp Gly Arg 


Asp 


Ala 


Ala 


Pro 


Pro Gly 


Arg 


Leu 




290 








295 










300 








Pro 


Gin 


Lys 


Leu 


Leu 


Leu Glu 


Thr 


Ala 


Ala 


Pro 


Arg 


Val Ser 


Ser 


Ser 


305 










310 








315 








320 


Arg Arg 


Tyr 


Ser 


Glu 


Gly Arg Glu 


Phe 


Arg 


Tyr Gly Ser Trp 


Arg 


Glu 










325 








330 








335 




Pro 


Val 


Asn 


Pro 


Tyr 


Tyr Val Asn 


Ala 


Gly 


Tyr Ala Leu Ala 


Pro 


Ala 








340 








345 








350 






Thr 


Ser 


Ala 


Asn 


Asp 


Ser Glu 


Gin 


Gin 


Ser 


Leu 


Ser 


Ser Asp 


Ala 


Asp 






355 








360 










365 






Thr 


Leu 


Ser 


Leu 


Thr 


Asp Ser 


Ser 


Val 


Asp 


Gly 


He 


Pro Pro 


Tyr Arg 




370 








375 










380 








lie 


Arg 


Lys 


Gin 


His 


Arg Arg Glu 


Met 


Gin 


Glu 


Ser 


Ala Gin 


Val 


Asn 


385 










390 








395 








400 


Gly Arg 


Val 


Pro 


Leu 


Pro His 


He 


Pro 


Arg 


Thr Tyr Arg Val 


Pro 


Lys 










405 








410 








415 




Glu 


Val 


Arg 


Val 


Glu 


Pro Gin 


Lys 


Phe 


Ala 


Glu 


Glu 


Leu He 


His 


Arg 








420 








425 








430 






Leu 


Glu 


Ala 


Val 


Gin 


Arg Thr Arg 


Glu 


Ala 


Glu 


Glu 


Lys Leu 


Glu 


Glu 






435 








440 










445 
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Arg 


Leu 


Lys 


Arg 


Val 


Arg 


Met 


Glu 


Glu Glu 


Gly Glu Asp Gly 


Asp 


Pro 




450 










455 








460 








Ser 


Ser 


Gly 


Pro 


Pro 


Gly 


Pro 


Cys 


His Lys 


Leu 


Pro 


Pro Ala 


Pro 


Ala 


465 










470 








475 








480 


Trp 


His 


His 


Phe 


Pro 


Pro 


Arg 


Leu 


Cys Trp 


Thr 


Trp 


Ala Cys 


Ala 


Gly 










485 








490 








495 




Leu 


Arg 


Asp 


Ala 


His 


Glu 


Glu 


Asn 


Pro Glu 


Ser 


He 


Leu Asp 


Glu 


His 








500 










505 






510 






Val 


Gin 


Arg 


Val 


Leu 


Arg 


Thr 


Thr 


Gly Arg 


Gin 


Ser 


Pro Gly 


Pro 


Gly 






515 










520 








525 






His 


Arg 


Ser 


Pro 


Asp 


Ser 


Gly 


His 


Val Ala 


Lys 


Met 


Pro Val 


Ala 


Leu 




530 










535 








540 








Gly 


Gly 


Ala 


Ala 


Ser Gly 


His 


Gly Lys His 


Val 


Pro 


Lys Ser 


Gly 


Ala 


545 










550 








555 






560 


Lys 


Leu 


Asp 


Ala 


Ala Gly 


Leu 


His 


His His 


Arg 


His 


Val His 


His 


His 










565 








570 








575 




Val 


His 


His 


Ser 


Thr 


Ala 


Arg 


Pro 


Lys Glu 


Gin 


Val 


Glu Ala 


Glu 


Ala 








580 










585 . 






590 






Thr 


Arg 


Arg 


Ala 


Gin 


Ser 


Ser 


Phe 


Ala Trp 


Gly Leu Glu Pro 


His 


Ser 






595 










600 








605 






His 


Gly 


Ala 


Arg 


Ser 


Arg 


Gly 


Tyr 


Ser Glu 


Ser 


Val 


Gly Ala 


Ala 


Pro 




610 










615 








620 








Asn 


Ala 


Ser 


Asp 


Gly Leu 


Ala 


His 


Ser Gly 


Lys 


Val 


Gly Val 


Ala 


Cys 


625 










630 








635 








640 


Lys 


Arg 


Asn 


Ala 


Lys 


Lys 


Ala 


Glu 


Ser Gly 


Lys 


Ser 


Ala Ser 


Thr 


Glu 










645 








650 








655 




Val 


Pro 


Gly 


Ala 


Ser 


Glu 


Asp 


Ala 


Glu Lys 


Asn 


Gin 


Lys He 


Met 


Gin 








660 










665 






670 






Trp 


lie 


He 


Glu 


Gly Glu 


Lys 


Glu 


He Ser 


Arg 


His 


Arg Arg 


Thr 


Gly 






675 










680 








685 






His 


Gly 


Ser 


Ser 


Gly Thr 


Arg 


Lys 


Pro Gin 


Pro 


His 


Glu Asn- 


Ser 


Arg 




690 










695 








700 








Pro 


Leu 


Ser 


Leu 


Glu 


His 


Pro 


Trp Ala Gly 


Pro 


Gin 


Leu Arg 


Thr 


Ser 


705 










710 








715 








720 


Val 


Gin 


Pro 


Ser 


His 


Leu 


Phe 


He 


Gin Asp 


Pro 


Thr 


Met Pro 


Pro 


His 










725 








730 








735 




Pro 


Ala 


Pro 


Asn 


Pro 


Leu 


Thr 


Gin 


Leu Glu 


Glu Ala Arg Arg 


Arg 


Leu 








740 










745 






750 






Glu 


Glu 


Glu 


Glu 


Lys 


Arg 


Ala 


Ser Arg Ala 


Pro 


Ser 


Lys Gin 


Arg 


Tyr 






755 










760 








765 






Val 


Gin 


Glu 


Val 


Met 


Arg 


Arg 


Gly Arg Ala 


Cys 


Val 


Arg Pro 


Ala 


Cys 




770 










775 








780 








Ala 


Pro 


Val 


Leu 


His 


Val 


Val 


Pro 


Ala Val 


Ser 


Asp 


Met Glu 


Leu 


Ser 


785 










790 








795 








800 


Glu 


Thr 


Glu 


Thr 


Arg 


Ser 


Gin 


Arg 


Lys Val 


Gly Gly Gly Ser 


Ala 


Gin 










805 








810 








815 




Pro 


Cys 


Asp 


Ser 


He 


Val 


Val 


Ala 


Tyr Tyr 


Phe 


Cys 


Gly Glu 


Pro 


He 








820 










825 






830 






Pro 


Tyr 


Arg 


Thr 


Leu 


Val 


Arg 


Gly Arg Ala 


Val 


Thr 


Leu Gly 


Gin 


Phe 






835 










840 








845 






Lys 


Glu 


Leu 


Leu 


Thr 


Lys 


Lys 


Gly 


Ser Tyr 


Arg 


Tyr 


Tyr Phe 


Lys 


Lys 




850 










855 








860 








Val 


Ser 


Asp 


Glu 


Phe Asp 


cys 


Gly Val Val 


Phe 


Glu 


Glu Val 


Arg 


Glu 
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865 



870 



875 



880 



Asp Glu Ala Val Leu Pro Val Phe Glu Glu Lys lie lie Gly Lys Val 
885 890 895 

Glu Lys Val Asp 



<210> 92 
<211> 591 
<212> PRT 

<213> Homo sapiens 
<400> 92 

Met Val Pro Val Ala Val Thr Ala Ala Val Ala Pro Val Leu Ser lie 

15 10 15 

Asn Ser Asp Phe Ser Asp Leu Arg Glu lie Lys Lys Gin Leu Leu Leu 

20 25 30 

lie Ala Gly Leu Thr Arg Glu Arg Gly Leu Leu His Ser Ser Lys Trp 

3 5 4 0 45 

Ser Ala Glu Leu Ala Phe Ser Leu Pro Ala Leu Pro Leu Ala Glu Leu 

50 55 60 

Gin Pro Pro Pro Pro lie Thr Glu Glu Asp Ala Gin Asp Met Asp Ala 
65 70 75 80 

Tyr Thr Leu Ala. Lys Ala Tyr Phe Asp Val Lys Glu Tyr Asp Arg Ala 

85 90 95 

Ala His Phe Leu His Gly Cys Asn Ser Lys Lys Ala Tyr Phe Leu Tyr 

100 105 110 

Met Tyr Ser Arg Tyr Leu Ser Gly Glu Lys Lys Lys Asp Asp Glu Thr 

115 120 125 

Val Asp Ser Leu Gly Pro Leu Glu Lys Gly Gin Val Lys Asn Glu Ala 

130 135 140 

Leu Arg Glu Leu Arg Val Glu Leu Ser Lys Lys His Gin Ala Arg Glu 
145 150 155 160 

Leu Asp Gly Phe Gly Leu Tyr Leu Tyr Gly Val Val Leu Arg Lys Leu 

165 170 175 

Asp Leu Val Lys Glu Ala lie Asp Val Phe Val Glu Ala Thr His Val 

180 185 190 

Leu Pro Leu His Trp Gly Ala Trp Leu Glu Leu Cys Asn Leu lie Thr 

195 200 205 

Asp Lys Glu Met Leu Lys Phe Leu Ser Leu Pro Asp Thr Trp Met Lys 

210 215 220 

Glu Phe Phe Leu Ala His lie Tyr Thr Glu Leu Gin Leu lie Glu Glu 
225 230 235 240 

Ala Leu Gin Lys Tyr Gin Asn Leu lie Asp Val Gly Phe Ser Lys Ser 

245 250 255 

Ser Tyr He Val Ser Gin He Ala Val Ala Tyr His Asn He Arg Asp 

260 265 270 

lie Asp Lys Ala Leu Ser He Phe Asn Glu Leu Arg Lys Gin Asp Pro 

275 280 285 

Tyr Arg He Glu Asn Met Asp Thr Phe Ser Asn Leu Leu Tyr Val Arg 

290 295 300 

Ser Met Lys Ser Glu Leu Ser Tyr Leu Ala His Asn Leu Cys Glu He 
305 310 315 320 

Asp Lys Tyr Arg Val Glu Thr Cys Cys Val He Gly Asn Tyr Tyr Ser 



900 
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325 










330 










335 




Leu 


Arg 


Ser 


Gin 
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Glu 


Lys 
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Ala 


Leu 


Tyr 
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Gin Arg Ala 


Leu 








340 










345 










350 






Lys 


Leu 


Asn 


Pro 


Arg 


Tyr 


Leu 


Gly Ala 


Trp 


Thr 


Leu 


Met 
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Glu 
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Tyr 


Met 


Glu 


Met 
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Asn 


Thr 


Ser 
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Tyr Arg 
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He 


Glu 
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Lys 


Arg 


Asp 


Tyr 


Arg 


Ala 


Trp 


Tyr 


Gly Leu 


Gly 
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395 










400 
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Thr 


Tyr 


Glu 


He 
405 


Leu 


Lys 


Met 


Pro 


Phe 
410 


Tyr 
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Tyr 
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Tyr 


Arg 


Arg 


Ala 
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Gin 


Leu Arg 
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Asp 
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Met 
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Val 


Ala 
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Leu 


Gly 


Glu 
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Cys 


Tyr 


Glu 


Lys 


Leu 
440 
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Gin 


Leu 


Val 


Glu 
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Lys 
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Tyr 


Trp 
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Tyr 


Ala 


Val 


Gly Asp 


Val 


Glu 


Lys 


Met 


Ala 


Leu 
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Val 


Lys 


Leu 


Ala 


Lys 


Leu 
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Glu 
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Leu 


Thr 


Glu 
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Glu 


Gin 


Ala 
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470 










475 










480 


Ala 


Gin 
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485 


Lys 


Tyr 


He 
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Asp 
490 


He 


Tyr 


Ser 


Cys 
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495 


Glu 
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Val 


Glu 
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Leu 


Glu 
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Thr 
505 


Ala 


Phe 


Arg 


Tyr 


Leu 
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Ala 
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Tyr 


Tyr 


Phe 
515 


Lys 


Cys 


Lys 


Leu 


Trp 
520 


Asp 


Glu 


Ala 


Ser 


Thr 
525 


Cys 


Ala 


Gin 


Lys 


Cys 


Cys 


Ala 


Phe 


Asn Asp 


Thr Arg Glu 


Glu Gly Lys Ala Leu 


Leu 




530 










535 










540 










Arg 


Gin 


He 


Leu 


Gin 


Leu 


Arg 


Asn Gin Gly 


Glu 


Thr 


Pro 


Thr 


Thr 


Glu 


545 










550 










555 










560 


Val 


Pro 


Ala 


Pro 


Phe 
565 


Phe 


Leu 


Pro 


Ala 


Ser 
570 


Leu 


Ser 


Ala 


Asn 


Asn 
575 


Thr 


Pro 


Thr 


Arg 


Arg 
580 


Val 


Ser 


Pro 


Leu 


Asn 
585 


Leu 


Ser 


Ser 


Val 


Thr 
590 


Pro 





<210> 93 
<211> 914 
<212> PRT 

<213> Homo sapiens 
<400> 93 

Val Tyr Gin Val Leu Leu Val Gly 

1 5 
Ser Gly Leu Gin Leu Glu Gin Leu 
20 

He Pro Thr Ala Gly Leu Pro Thr 

35 40 
Cys His Gin Ala Trp Leu Asp Phe 

50 55 
Gin Asn Cys Gin Ala Ala Cys Ala 
65 70 
Val Lys Ala Val Pro Gin Pro Met 
85 

Leu Gin Gin Thr Glu Val Leu Met 



Ser Thr Leu Leu Lys Glu Val Pro 

10 15 
Pro Ser Gin Ser Leu Leu Thr His 
25 30 
Ser Leu Gly Gly Gly Leu Pro Tyr 
45 

Arg Arg Arg Leu Glu Ala Leu Leu 
60 

Leu Leu Gin Gly Ala He Glu Ser 

75 80 
Glu Pro Gly Glu Val Gly Gin Leu 

90 95 
Gin Gin Val Leu Asp Ser Pro Trp 
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100 105 110 

Leu Ala Trp Leu Gin Cys Gin Gly Gly Arg Glu Leu Thr Trp Leu Lys 

115 120 125 

Gin Glu Val Pro Glu Val Thr Leu Ser Pro Asp Tyr Arg Thr Ala Met 
130 135 140 

Asp Lys Ala Asp Glu Leu Tyr Asp Arg Val Asp Gly Leu Leu His Gin 
145 150 155 160 

Leu Thr Leu Gin Ser Asn Gin Arg lie Gin Ala Leu Glu Leu Val Gin 

165 170 175 

Thr Leu Glu Ala Arg Glu Ser Gly Leu His Gin lie Glu Val Trp Leu 

180 185 190 

Gin Gin Val Gly Trp Pro Ala Leu Glu Glu Ala Gly Glu Pro Ser Leu 

195 200 205 

Asp Met Leu Leu Gin Ala Gin Gly Ser Phe Gin Glu Leu Tyr Gin Val 

210 215 220 

Ala Gin Glu Gin Val Arg Gin Gly Glu Lys Phe Leu Gin Pro Leu Thr 
225 230 235 240 

Gly Trp Glu Ala Ala Glu Leu Asp Pro Pro Gly Ala Arg Phe Leu Ala 

245 250 255 

Leu Arg Ala Gin Leu Thr Glu Phe Ser Arg Ala Leu Ala Gin Arg Cys 

260 265 270 

Gin Arg Leu Ala Asp Ala Glu Arg Leu Phe Gin Leu Phe Arg Glu Ala 

275 280 285 

Leu Thr Trp Ala Glu Glu Gly Gin Arg Val Leu Ala Glu Leu Glu Gin 

290 295 300 

Glu Arg Pro Gly Val Val Leu Gin Gin Leu Gin Leu His Trp Thr Arg 
305 310 315 320 

His Pro Asp Leu Pro Pro Ala His Phe Arg Lys Met Trp Ala Leu Ala 

325 330 335 

Thr Gly Leu Gly Ser Glu Ala lie Arg Gin Glu Cys Arg Trp Ala Trp 

340 345 350 

Ala Arg Cys Gin Asp Thr Trp Leu Ala Leu Asp Gin Lys Leu Glu Ala 

355 360 365 

Ser Leu Lys. Leu Pro Pro Val Gly Ser Thr Ala Ser Leu Cys Val Ser 

370 375 380 

Gin Val Pro Ala Ala Pro Ala His Pro Pro Leu Arg Lys Ala Tyr Ser 
385 390 395 " 400 

Phe Asp Arg Asn Leu Gly Gin Ser Leu Ser Glu Pro Ala Cys His Cys 

405 410 415 

His His Ala Ala Thr He Ala Ala Cys Arg Arg Pro Glu Ala Gly Gly 

420 425 430 

Gly Ala Leu Pro Gin Ala Ser Pro Thr Val Pro Pro Pro Gly Ser Ser 

435 440 445 

Asp Pro Arg Ser Leu Asn Arg Leu Gin Leu 'Val Leu Ala Glu Met Val 

450 455 460 

Ala Thr Glu Arg Glu Tyr Val Arg Ala Leu Glu Tyr Thr Met Glu Asn 
465 470 475 480 

Tyr Phe Pro Glu Leu Asp Arg Pro Asp Val Pro Gin Gly Leu Arg Gly 

485 490 495 

Gin Arg Ala His Leu Phe Gly Asn Leu Glu Lys Leu Arg Asp Phe His 

500 505 510 

Cys His Phe Phe Leu Arg Glu Leu Glu Ala Cys Thr Arg His Pro Pro 
515 520 525 



BNSDOCID: <WO 0177327A1_I_> 



WO 01/77327 PCT/US00/I6951 

243 



Arg 


Val 


Ala 


Tyr 


Ala 


Phe 


Leu 


Arg 




530 










535 




Ala 


Leu 


Tyr 


Ser 


Lys 


Asn 


Lys 


Pro 


545 










550 






Tyr 


Gly 


His 


Thr 


Phe 


Phe 


Lys 


Asp 










565 








Leu 


Asp 


Leu 


Ala 


Ser 


Tyr 


Leu 


Leu 








580 










Tyr 


Ala 


Leu 


Leu 


Leu 


Gin 


Glu 


Leu 






595 










600 


Gin 


Glu 


Leu 


Ser 


Ala 


Leu 


Arg 


Glu 




610 










615 




Leu 


Arg 


His 


Gly 


Asn 


Asp 


Leu 


Leu 


625 










630 






Asp 


Val 


Asn 


Leu 


Lys 


Glu 


Gin 


Gly 










645 








Val 


Val 


Arg 


Thr 


Gly Arg 


His 


Lys 








660 










Glu 


Glu 


Leu 


Leu 


Leu 


Phe 


Ser 


Lys 






675 










680 


Asp 


Thr 


Phe 


Ala 


Tyr 


Lys 


Arg 


Ser 




690 










695 




Thr 


Glu 


Cys 


Cys 


Gly Asn 


Ser 


Asn 


705 










710 






Arg 


Arg 


Lys 


Ala 


Arg 


Asp 


Thr 


Phe 










725 








lie 


Lys 


Gin 


Ala 


Trp 


Thr 


Ala 


Asp 








740 










Ala 


Val 


His 


Asn 


Lys 


Glu 


Val 


Arg 






755 










760 


Val 


Gly 


Asn 


Lys 


Ala 


Phe 


Arg 


Asp 




770 










775 




Asn 


Asp 


Arg 


Thr 


Val 


Asn 


Tyr 


Val 


785 










790 






Arg 


Ala 


Ser 


lie 


Ala 


Val 


Ala 


Pro 










805 








Gly 


Ala 


Ser 


Asn 


Ser 


Leu 


Pro 


Gly 








820 










Gly 


Ser 


Leu 


Asn 


Leu 


His 


Leu 


Tyr 






835 










84 0 


Arg 


Cys 


Pro 


Leu 


Tyr 


Pro 


Ser 


Phe 




850 










855 




Glu 


Ala 


Glu 


Leu 


Gly 


Gly Gin 


Pro 


865 










870 






lie 


Ser 


Ser 


Gin 


Cys 


Pro 


Ser 


Ala 










885 








Ser 


Cys 


Val 


Ser 


Gly Gin Ala 


Leu 



900 



Cys Val 



His 


Arg Val 


Gin 


Phe 


Gly Met 


Tyr 








540 










Arg 


Ser Asp 


Ala 


Leu 


Met 


Ser 


Ser 






555 










560 


Lys 


Gin 


Gin 


Ala 


Leu 


Gly Asp 


His 




570 










575 




Lys 


Pro 


lie 


Gin Arg 


Met 


Gly 


Lys 


585 










590 






Ala 


Arg 


Ala 


Cys 


Gly 


Gly 


Pro 


Thr 










605 








Ala 


Gin 


Ser 


Leu 


Val 


His 


Phe 


Gin 








620 










Ala 


Met 


Asp 


Ala 


lie 


Gin 


Gly 


Cys 






635 










640 


Gin 


Leu 


Val 


Arg 


Gin 


Asp 


Glu 


Phe 




650 










655 




Ser 


Val 


Arg 


Arg 


He 


Phe 


Leu 


Phe 


665 










670 






Pro Arg His 


Gly Pro 


Thr Gly 


Val 










685 








Phe 


Lys 


Met 


Ala 


Asp 


Leu Gly 


Leu 








700 










Leu Arg 


Phe 


Glu 


He 


Trp 


Phe 


Arg 






715 










720 


Val 


Leu 


Gin 


Ala 


Ser 


Ser 


Leu 


Ala 




730 










735 




lie 


Ser 


His 


Leu 


Leu 


Trp 


Arg 


Gin 


745 










750 






Met 


Ala 


Glu 


Met 


Val 


Ser 


Met 


Gly 










765 








lie 


Ala 


Pro 


Ser 


Glu 


Glu 


Ala 


He 








780 










Leu 


Lys 


Cys 


Arg 


Glu 


Val 


Arg 


Ser 






795 










800 


Phe 


Asp 


His 


Asp 


Ser 


Leu 


Tyr 


Leu 




810 










815 




Asp 


Pro 


Ala 


Ser 


Cys 


Ser 


Val 


Leu 


825 










830 






Arg 


Asp 


Pro 


Ala 


Leu 


Leu 


Gly 


Leu 










845 








Leu 


Glu 


Glu 


Ala 


Ala 


Leu 


Glu 


Ala 








860 










Ser 


Leu 


Thr 


Ala 


Glu 


Asp 


Ser 


Glu 






875 










880 


Ser Gly Ser Ser Gly 


Ser 


Asp 


Ser 




890 










895 




Gly Arg Gly Leu Glu 


Asp 


Leu 


Pro 


905 










910 







<210> 94 
<211> 277 
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<212> PRT 

<213> Homo sapiens 



<400> 94 



Leu Asn 


Tyr 


Leu 


Leu 


Glu Ser Arg Leu Glu Ala 


Ala 


Ala 


His 


Cys 


Ala 


1 






5 


10 








15 




Leu Lys 


Gin 


Gly 


He 


Ala Thr Ala Ser Leu Leu 


Pro 


Ala 


Gin 


Leu 


Gin 






20 




25 






30 






Pro Ala 


Val 


Leu 


Thr 


Val Val Thr Cys His Val 


Val 


Val 


Ser 


Val 


His 




35 






40 




45 








Gly His 


His 


Thr 


Asp 


Gly Cys Leu Ala Ala Leu 


Cys 


Arg 


Glu 


Asp 


Arg 


50 








55 


60 










Thr Gly Thr 


Gly 


Gly 


Ala Phe Trp Cys Lys- Asn 


Arg 


Val 


He 


Val 


Ser 


65 








70 75 










80 


His Ala 


Val 


Asp 


Val 


Val Leu His Val His Gly 


Glu 


Gly 


Asn 


Pro 


Val 








85 


90 








95 




Gin Ala 


Leu 


He 


Ala 


His Gly Ala Pro Glu Ala 


Ala 


Trp 


Val 


Val 


Gly 






100 




105 






110 






Leu Ala 


Gin 


Gly 


Leu 


Gin Asp- His Phe His Asp 


Glu 


Met 


Ser 


Thr 


His 




115 






12 0 




125 








Ala Ala 


Phe 


Val 


Gly 


Arg Leu Leu Glu Pro Gly 


Val 


Gin 


Glu 


Val 


Leu 


130 








135 


140 










Leu Ala 


Val 


His. 


Phe 


Leu Thr His Val Val Glu 


Arg 


Leu 


Pro 


Thr 


Glu 


145 








150 155 










160 


Ser Ser 


Pro 


Thr 


Arg 


Val Ala Gly Glu Ala Val 


Ser 


Val 


He 


Lys 


Thr 








165 


170 








175 




Pro His 


Cys 


Leu 


Ala 


Arg Leu Leu Gly Ser Val 


Asp 


Ala 


Lys 


Pro 


Thr 






180 




185 






190 






Leu Asp 


Ala 


Asn 


Ala 


Glu Val Val Pro Arg Arg 


Ala 


Arg 


Leu 


Glu Arg 




195 






200 




205 








Pro Leu 


Gin 


Leu 


Pro 


Gly Glu Arg Leu Gin Pro 


Pro 


Leu 


Gly 


Arg 


Ala 


210 








215 


220 










Trp Ala 


Ala 


Leu 


Pro 


Ala Arg Gly Gin Arg Glu 


Cys 


Arg 


Gin 


Arg 


Glu 


225 








230 235 










240 


Gly Gly Arg 


Pro 


Arg 


Arg Leu Arg Gly Ala Ser 


Gly Arg 


Gly 


Ala Gly 








245 


250 








255 




Ala Gly Arg 


Glu 


Glu 


Val Ser Val Gly Phe Ser 


Ala 


Gin 


Trp 


Glu 


Phe 






260 




265 






270. 






Gly Ser 


Gly 


Arg 


His 
















275 



















<210> 95 
<211> 1120 
<212> PRT 

<213> Homo sapiens 



<400> 95 

Met Trp Arg Val Lys Lys Leu Ser Leu Ser Leu Ser Pro Ser Pro Gin 

15 10 15 

Thr Gly Lys Pro Ser Met Arg Thr Pro Leu Arg Glu Leu Thr Leu Gin 

20 25 30 

Pro Gly Ala Leu Thr Thr Ser Gly Lys Arg Ser Pro Ala Cys Ser Ser 
35 40 45 
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Leu Thr Pro Ser Leu Cys Lys Leu Gly Leu Gin Glu Gly Ser Asn Asn 

50 55 60 

Ser Ser Pro Val Asp Phe Val Asn Asn Lys Arg Thr Asp Leu Ser Ser 
65 70 75 80 

Glu His Phe Ser His Ser Ser Lys Trp Leu Glu Thr Cys Gin His Glu 

85 90 95 

Ser Asp Glu Gin Pro Leu Asp Pro lie Pro Gin lie Ser Ser Thr Pro 

100 105 110 

Lys Thr Ser Glu Glu Ala Val Asp Pro Leu Gly Asn Tyr Met Val Lys 

115 120 125 

Thr lie Val Leu Val Pro Ser Pro Leu Gly Gin Gin Gin Asp Met lie 

130 135 140 

Phe Glu Ala Arg Leu Asp Thr Met Ala Glu Thr Asn Ser lie Ser Leu 
145 150 155 160 

Asn Gly Pro Leu Arg Thr Asp Asp Leu Val Arg Glu Glu Val Ala Pro 

165 170 175 

Cys Met Gly Asp Arg Phe Ser Glu Val Ala Ala Val . Ser Glu Lys Pro 

180 185 190 

lie Phe Gin Glu Ser Pro Ser His Leu Leu Glu Glu Ser Pro Pro Asn 

195 200 205 

Pro Cys Ser Glu Gin Leu His Cys Ser Lys Glu Ser Leu Ser Ser Arg 

210 215 220 

Thr Glu Ala Val Arg Glu Asp Leu Val Pro Ser Glu Ser Asn Ala Phe 
225 230 235 240 

Leu Pro Ser Ser Val Leu Trp Leu Ser Pro Ser Thr Ala Leu Ala Ala 

245 250 255 

Asp Phe Arg Val Asn His Val Asp Pro Glu Glu Glu lie -Val Glu His 

260 265 270 

Gly Ala Met Glu Glu Arg Glu Met Arg Phe Pro Thr His Pro Lys Glu 

275 280 285 

Ser Glu Thr Glu Asp Gin Ala Leu Val Ser Ser Val Glu Asp lie Leu 

290 295 300 

Ser Thr Cys Leu Thr Pro Asn Leu. Val Glu Met Glu Ser Gin Glu Ala 
305 310 315 320 

Pro Gly Pro Ala Val Glu Asp Val Gly Arg lie Leu Gly Ser Asp Thr 

325 330 335 

Glu Ser Trp Met Ser Pro Leu Ala Trp Leu Glu Lys Gly Val Asn Thr 

340 345 . 350 

Ser Val Met Leu Glu Asn Leu Arg Gin Ser Leu Ser Leu Pro Ser Met 

355 360 365 

Leu Arg Asp Ala Ala lie Gly Thr Thr Pro Phe Ser Thr Cys Ser Val 

370 375 380 

Gly Thr Trp Phe Thr Pro Ser Ala Pro Gin Glu Lys Ser Thr Asn Thr 
385 390 395 400 

Ser Gin Thr Gly Leu Val Gly Thr Lys His Ser Thr Ser Glu Thr Glu 

405 410 415 

Gin Leu Leu Cys Gly Arg Pro Pro Asp Leu Thr Ala Leu Ser Arg His 

420 425 430 

Asp Leu Glu Asp Asn Leu Leu Ser Ser Leu Val lie Val Glu Phe Leu 

435 440 445 

Ser Arg Gin Leu Arg Asp Trp Lys Ser Gin Leu Ala Val Pro His Pro 

450 455 460 

Glu Thr Gin Asp Ser Ser Thr Gin Thr Asp Thr Ser His Ser Gly lie 
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465 










470 










475 










480 


Thr 


Asn 


Lys 


Leu 


Gin 


His 


Leu 


Lys 


Glu 


Ser 


His 


Glu 


Met 


Gly 


Gin 


Ala 










485 










490 










495 




Leu 


Gin 


Gin 


Ala 


Arg 


Asn 


Val 


Met 


Gin 


Ser 


Trp 


Val 


Leu 


He 


Ser 


Lys 








500 










505 










510 






Glu 


Leu 


He 


Ser 


Leu 


Leu 


His 


Leu 


Ser 


Leu 


Leu 


His 


Leu 


Glu 


Glu 


Asp 






515 










520 










525 








Lys 


Thr 


Thr 


Val 


Asn 


Gin 


Glu 


Ser Arg Arg Ala 


Glu 


Thr 


Leu 


Val 


Cys 




530 










535 










540 










Cys 


Cys 


Phe 


Asp 


Leu 


Leu 


Lys 


Lys 


Leu 


Arg Ala 


Lys 


Leu 


Gin 


Ser 


Leu 


545 










550 










555 










560 


Lys 


Ala 


Glu 


Arg 


Glu 


Glu 


Ala Arg His Arg Glu 


Glu 


Met 


Ala 


Leu 


Arg 










565 










570 










575 




Gly 


Lys 


Asp 


Ala 


Ala 


Glu 


He 


Val 


Leu 


Glu 


Ala 


Phe 


Cys 


Ala 


His 


Ala 








580 










585 










590 






Ser 


Gin 


Arg 


He 


Ser 


Gin 


Leu 


Glu 


Gin Asp Leu 


Ala 


Ser 


Met 


Arg 


Glu 






595 










600 










605 








Phe 


Arg 


Gly Leu 


Leu 


Lys 


Asp 


Ala 


Gin 


Thr 


Gin 


Leu 


Val 


Gly Leu 


His 




610 










615 










620 










Ala 


Lys 


Gin 


Glu 


Glu 


Leu 


Val 


Gin 


Gin 


Thr 


Val 


Ser 


Leu 


Thr 


Ser 


Thr 


625 










630 










635 










640 


Leu 


Gin 


Gin Asp 


Trp 


Arg 


Ser 


Met 


Gin Leu Asp 


Tyr 


Thr 


Thr 


Trp 


Thr 










645 










650 










655 




Ala 


Leu 


Leu 


Ser 


Arg 


Ser 


Arg 


Gin 


Leu 


Thr 


Glu 


Lys 


Leu 


Thr 


Val 


Lys 








660 










665 










670 






Ser 


Gin 


Gin 


Ala 


Leu 


Gin 


Glu Arg Asp Val Ala 


He 


Glu 


Glu 


Lys 


Gin 






675 










680 










685 








Glu 


Val 


Ser 


Arg 


Val 


Leu 


Glu 


Gin 


Val 


Ser 


Ala 


Gin 


Leu 


Glu 


Glu 


Cys 




690 










695 










700 










Lys 


Gly 


Gin 


Thr 


Glu 


Gin 


Leu 


Glu 


Leu 


Glu 


Asn 


He 


Arg 


Leu 


Ala 


Thr 


705 










710 










715 










720 


Asp 


Leu 


Arg 


Ala 


Gin 


Leu 


Gin 


He 


Leu 


Ala 


Asn 


Met 


Asp 


Ser 


Gin 


Leu 










725 










730 










735 




Lys 


Glu 


Leu 


Gin 


Ser 


Gin 


His 


Thr 


His 


Cys 


Ala 


Gin 


Asp 


Leu 


Ala 


Met 








740 










745 










750 






Lys 


Asp 


Glu 


Leu 


Leu 


Cys 


Gin 


Leu 


Thr 


Gin 


Ser 


Asn 


Glu 


Glu 


Gin 


Ala 






755 










760 










765 








Ala 


Gin 


Cys 


Val 


Lys 


Glu 


Glu 


Met 


Ala 


Leu 


Lys 


His 


Met 


Gin 


Ala 


Glu 




770 










775 










780 










Leu 


Gin 


Gin 


Gin 


Gin 


Ala 


Val 


Leu 


Ala 


Lys 


Glu 


Val 


Arg Asp 


Leu 


Lys 


785 










790 










795 










800 


Glu 


Thr 


Leu 


Glu 


Phe 


Ala 


Asp 


Gin 


Glu 


Asn 


Gin 


Val 


Ala 


His 


Leu 


Glu 










805 










810 










815 




Leu 


Gly 


Gin 


Val 


Glu 


Cys 


Gin 


Leu 


Lys 


Thr 


Thr 


Leu 


Glu 


Val 


Leu 


Arg 








820 










825 










830 






Glu 


Arg 


Ser 


Leu 


Gin 


Cys 


Glu 


Asn 


Leu 


Lys 


Asp 


Thr 


Val 


Glu 


Asn 


Leu 






835 










840 










845 








Thr 


Ala 


Lys 


Leu 


Ala 


Ser 


Thr 


He Ala Asp Asn 


Gin 


Glu 


Gin 


Asp 


Leu 




850 










855 










860 










Glu 


Lys 


Thr Arg 


Gin 


Tyr 


Ser 


Gin 


Lys 


Leu Gly 


Leu 


Leu 


Thr 


Glu 


Gin 


865 










870 










875 










880 


Leu 


Gin 


Ser 


Leu 


Thr 


Leu 


Phe 


Leu 


Gin 


Thr 


Lys 


Leu 


Lys 


Glu 


Lys 


Thr 










885 










890 










895 
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Glu Gin Glu Thr Leu Leu Leu Ser Thr Ala Cys Pro Pro Thr Gin Glu 

900 905 910 

His Pro Leu Pro Asn Asp Arg Thr Phe Leu Gly Ser lie Leu Thr Ala 

915 920 925 

Val Ala Asp Glu Glu Pro Glu Ser Thr Pro Val Pro Leu Leu Gly Ser 

930 . 935 940 

Asp Lys Ser Ala Phe Thr Arg Val Ala Ser Met Val Ser Leu Gin Pro 
945 950 955 960 

Ala Glu Thr Pro Gly Met Glu Glu Ser Leu Ala Glu Met Ser lie Met 

965 970 975 

Thr Thr Glu Leu Gin Ser Leu Cys Ser Leu Leu Gin Glu Ser Lys Glu 

980 985 990 

Glu Ala lie Arg Thr Leu Gin Arg Lys lie Cys Glu Leu Gin Ala Arg 

995 1000 1005 

Leu Gin Ala Gin Glu Glu Gin His Gin Glu Val Gin Lys Ala Lys . Glu 

1010 1015 1020 

Ala Asp lie Glu Lys Leu Asn Gin Ala Leu Cys Leu Arg Tyr Lys Asn 
1025 1030 1035 1040 

Glu Lys Glu Leu Gin Glu Val lie Gin Gin Asn Glu Lys lie Leu Glu 

1045 1050 1055 

Gin lie Asp Lys Ser Gly Glu Leu lie Ser Leu Arg Glu Glu Val Thr 

1060 1065 1070 

His Leu Thr Arg Ser Leu Arg Arg Ala Glu Thr Glu Thr Lys Val Leu 

1075 1080 1085 

Gin Glu Ala Trp Gin Ala Ser Trp Thr Pro Thr Ala Ser Leu Trp Pro 

1090 1095 1100 

Pro lie Gly Ser Arg Arg Lys Cys Gly Ser Leu Arg Arg Trp Thr Asn 
1105 1110 1115 1120 

<210> 96 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<400> 96 

Met Gly Thr Thr Ala Arg Ala Ala Leu Val Leu Thr Tyr Leu Ala Val 

1.5 10 15 

Ala Ser Ala Ala Ser Glu Gly Gly Phe Thr Ala Thr Gly Gin Arg Gin 

20 25 30 

Leu Arg Pro Glu His Phe Gin Glu Val Gly Tyr Ala Ala Pro Pro Ser 

35 40 45 

Pro Pro Leu Ser Arg Ser Leu Pro Met Asp His Pro Asp Ser Ser Gin 

50 55 60 

His Gly Pro Pro Phe Glu Gly Gin Ser Gin Val Gin Pro Pro Pro Ser 
65 70 75 80 

Gin Glu Ala Thr Pro Leu Gin Gin Glu Lys Leu Leu Pro Ala Gin Leu 

85 90 95 

Pro Ala Glu Lys Glu Val Gly Pro Pro Leu Pro Gin Glu Ala Val Pro 

100 105 110 

Leu Gin Lys Glu Leu Pro Ser Leu Gin His Pro Asn Glu Gin Lys Glu 

115 120 125 

Gly Thr Pro Ala Pro Phe Gly Asp Gin Ser His Pro Glu Pro Glu Ser 
130 135 140 
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Trp Asn Ala Ala 
145 

Gly His Arg Leu 

Leu Asn Gin lie 
180 

Trp Asn Leu Pro 
195 

Thr Leu Asn Phe 
210. 

Ser His Thr Asn 
225 

Met Ser Arg Phe 

Trp Cys Cys Thr 
260 

Glu Ala Pro. Gin 
275 

Pro Asp lie Ser 
290 

Thr Leu Asp Asn 
305 

Val Pro Arg Asn 

Ala Leu lie Gin 
340 

Asn Asn His Thr 
355 

Tyr Cys Asp Arg 
370 

Arg His Pro Pro 
385 

Pro Tyr Pro Asn 

Val Thr Pro Asn 
420 

Thr Lys His Lys 
435 

Cys Cys Asp Leu 
450 

Lys Leu Thr Phe 
465 

Arg Asp Pro Ala 

Asn Cys Phe Asn 
500 

Asp Thr Glu Asn 
515 

Thr Asn He Ser 
530 

<210> 97 
<211> 462 



Gin His Cys Gin 
150 

Asp Gly Phe Pro 
165 

Cys Leu Pro Asn 

Gin Ser Ser Tyr 
200 

Leu Glu He Gly 
215 

Arg Leu Glu Cys 
230 

Cys Glu Ala Glu 
245 

Arg Gin Gly Glu 

Pro His Tyr Gin 
280 

Ser Gly Leu Glu 
2 95 

He Lys Asn He 
310 

Leu Pro Ala Thr 
325 

Leu Glu Arg Glu 

Cys Thr Trp Lys 
360 

Glu Tyr Ala Val 
375 

Ser Pro Thr Arg 
390 

Tyr Asp Arg Asp 
405 

Leu Met Gly His 

His He Pro Gly 
440 

Pro Phe Pro Glu 
455 

He Asn Asp Leu 
470 

Leu Cys Cys Tyr 
485 

He Asn Tyr Leu 

Ala Lys Gly Gin 
520 

Ser Thr Ser Glu 
535 



Gin Asp Arg Ser 
155 

Pro Gly Arg Pro 
170 

Arg Gin His Val 
185 

Ser His Leu Thr 

Tyr Ser Arg Cys 
220 

Ala Lys Leu Val 
235 

Phe Ser Val Lys 
250 

Ala Arg Phe Ser 
265 

Leu Arg Ala . Cys 

Leu Pro Phe Pro 
300 

Cys His Leu Arg 
315 

Asp Pro Leu Gin 
330 

Phe Gin Arg Cys 
345 

Ala Trp Glu Asp 

Lys Thr His His 
380 

Asp Glu Cys Phe 
395 

He Leu Thr He 
410 

Leu Cys Gly Asn 
425 

Leu He His Asn 

Gin Ala Cys Cys 
460 

Cys Gly Pro Arg 
475 

Leu Ser Pro Gly 
490 

Arg Asn Val Ala 
505 

Gly Glu Gin Gly 

Pro Lys Glu Glu 
540 



Gin Gly Gly Trp 
160 

Ser Pro Asp Asn 
175 

Val Tyr Gly Pro 
190 

Arg Gin Gly Glu 
205 

Cys His Cys Arg 

Trp Glu Glu Ala 
24 0 

Thr Arg Pro His 
255 

Cys Phe Gin Glu 
270 

Pro Ser His Gin 
285 

Pro Gly Val Pro 

Arg Phe Arg Ser 
320 

Arg Glu Leu Leu 
335 

Cys Arg Gin Gly 
350 

Thr Leu Asp Lys 
365 

His Leu Cys Cys 

Ala Arg Arg Ala 
400 

Asp He Ser Arg 
415 

Gin Arg Val Leu 
430 

Met Thr Ala Arg 
445 

Ala Glu Glu Glu 

Arg Asn He Trp 
480 

Asp Glu Gin Val 
495 

Leu Val Ser Gly 
510 

Ser Thr Gly Gly 
525 
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<212> PRT 

<213> Homo sapiens 



<400> 97 



Met 


Gly 


Lys 


Glu 


Lys 


Thr 


His 


He 


Asn 


He 


Val 


Val 


He Gly His 


Val 


1 








5 










10 








15 




Asp 


Ser 


Gly 


Lys 


Ser 


Thr 


Thr 


Thr Gly His 


Leu 


He 


Tyr Lys 


Cys 


Gly 








20 










25 








30 






Gly 


He 


Asp 
35 


Lys 


Arg 


Thr 


He 


Glu 
40 


Lys 


Phe 


Glu 


Lys 


Glu Ala 
45 


Ala 


Glu 


Met 


Gly 


Lys 


Gly 


Ser 


Phe 


Lys 


Tyr Ala 


Trp 


Val 


Leu 


Asp Lys 


Leu 


Lys 




50 










55 










60 








Ala 


Glu 


Arg 


Glu 


Arg 


Gly 


He 


Thr 


He 


Asp 


He Ser Leu Trp Lys 


Phe 


65 










70 










75 








80 


Glu 


Thr 


Ser 


Lys 


Tyr 


Tyr 


Val 


Thr 


He 


He 


Asp 


Ala 


Pro Gly His 


Arg 










85 










90 








95 




Asp 


Phe 


He 


Lys 


Asn 


Met 


He 


Thr. Gly Thr 


Ser 


Gin 


Ala Asp 


Cys 


Ala 








100 










105 








110 






Val 


Leu 


He 


Val 


Ala 


Ala 


Gly 


Val 


Gly Glu 


Phe 


Glu Ala Gly 


He 


Ser 






115 










120 










125 






Lys 


Asn 
130 


Gly 


Gin 


Thr 


Arg 


Glu 
135 


His 


Ala 


Leu 


Leu 


Ala 
140 


Tyr Thr 


Leu 


Gly 


Val 


Lys 


Gin 


Leu 


He 


Val 


Gly 


Val 


Asn 


Lys 


Met 


Asp 


Ser Thr 


Glu 


Pro 


145 










150 










155 








160 


Pro 


Tyr 


Ser 


Gin 


Lys 
165 


Arg 


Tyr 


Glu 


Glu 


He 
170 


Val 


Lys 


Glu Val 


Ser 
175 


Thr 


Tyr 


He 


Lys 


Lys 
180 


He 


Gly 


Tyr 


Asn 


Pro 
185 


Asp 


Thr 


Val 


Ala Phe 
190 


Val 


Pro 


lie 


Ser 


Gly 
195 


Trp 


Asn 


Gly 


Asp 


Asn 
200 


Met 


Leu 


Glu 


Pro 


Ser Ala 
205 


Asn 


Met 


Pro 


Trp 


Phe 


Lys 


Gly 


Trp 


Lys 


Val 


Thr Arg 


Lys 


Asp 


Gly Asn Ala 


Ser 




210 










215 










220 








Gly 


Thr 


Thr 


Leu 


Leu 


Glu 


Ala 


Val 


Asp 


Cys 


He 


Leu 


Pro Pro 


Thr 


Arg 


225 










230 










235 








240 


Pro 


Thr 


Asp 


Lys 


Pro 


Leu 


Arg 


Leu 


Pro 


Leu 


Gin Asp Val Tyr Lys 


He 










245 










250 








255 




Gly 


Gly 


He 


Gly 


Thr 


Val 


Pro 


Val 


Gly Arg 


Val 


Glu Thr Gly Val 


Leu 








260 










265 








. 270 






Lys 


Pro 


Gly 
275 


Met 


Val 


Val 


Thr 


Phe 
280 


Ala 


Pro 


Val 


Asn 


Val Thr 
285 


Thr 


Glu 


Val 


Lys 
290 


Ser 


Val 


Glu 


Met 


His 
295 


His 


Glu 


Ala 


Leu 


Ser 
300 


Glu Ala 


Leu 


Pro 


Gly 


Asp 


Asn 


Val 


Gly 


Phe 


Asn 


Val 


Lys Asn 


Val 


Ser 


Val Lys 


Asp 


Val 


305 










310 










315 








320 


Arg 


Arg 


Gly 


Asn 


Val 


Ala 


Gly 


Asp 


Ser 


Lys 


Asn Asp 


Pro Pro 


Met 


Glu 










325 










330 








335 




Ala 


Ala 


Gly 


Phe 


Thr 


Ala 


Gin 


Val 


He 


He 


Leu 


Asn 


His Pro Gly 


Gin 








340 










345 








350 






He 


Ser 


Ala 


Gly 


Tyr 


Ala 


Pro 


Val 


Leu Asp 


Cys 


His 


Thr Ala 


His 


He 






355 










360 










365 






Ala 


Cys 
370 


Lys 


Phe 


Ala 


Glu 


Leu 
375 


Lys 


Glu 


Lys 


He 


Asp 
380 


Arg Arg 


Ser 


Gly 


Lys 


Lys 


Leu 


Glu 


Asp 


Gly 


Pro 


Lys 


Phe 


Leu 


Lys 


Ser Gly Asp 


Ala 


Ala 
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385 390 
lie Val Asp Met Val Pro 
405 

Asp Tyr Pro Pro Leu Gly 
420 

Val Ala Val Gly Val lie 
435 

Gly Lys Val Thr Lys Ser 
450 
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395 

Gly Lys Pro Met Cys Val 
410 

Arg Phe Ala Val Arg Asp 
425 

Lys Ala Val Asp Lys Lys 
440 

Ala Gin Lys Ala Gin Lys 
455 460 
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400 

Glu Ser Phe Ser 
415 

Met Arg Gin Thr 
430 

Ala Ala Gly Ala 
445 

Ala Lys 



<400> 98 

Lys Ser Lys Arg Gin Ala Gin Gin Met Val Gin Pro Gin Ser Pro Val 

■ 1 5 10 15 

Ala Val Ser Gin Ser Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr 

20 25 30 

Gin lie Asn Gin Gin Trp Glu Arg Thr Tyr Leu Gly Asn Val Leu Val 

35 40 45 

Cys Thr Cys Tyr Gly Gly Ser Arg Gly Phe Asn Cys Glu Ser Lys Pro 

50 55 60 

Glu Ala Glu Glu Thr Cys Phe Asp Lys Tyr Thr Gly Asn Thr Tyr Arg 
65 70 75 ~ 80 

Val Gly Asp Thr Tyr Glu Arg Pro Lys Asp Ser Met lie Trp Asp Cys 

85 90 95 

Thr Cys lie Gly Ala Gly Arg Gly Arg lie Ser Cys Thr He Ala Asn 

100 105 110 

Arg Cys His Glu Gly Gly Gin Ser Tyr Lys He Gly Asp Thr Trp Arg 

115 120 J 125 

Arg Pro His Glu Thr Gly Gly Tyr Met Leu Glu Cys Val Cys Leu Gly 

130 13 5 14 0 

Asn Gly Lys Gly Glu Trp Thr Cys Lys Pro He Ala Glu Lys Cys Phe 
145 150 155 * 160 

Asp His Ala Ala Gly Thr Ser Tyr Val Val Gly Glu Thr Trp Glu Lys 

165 170 175 

Pro Tyr Gin Gly Trp Met Met Val Asp Cys Thr Cys Leu Gly Glu Gly 

180 185 190 

Ser Gly Arg lie Thr Cys Thr Ser Arg Asn Arg Cys Asn Asp Gin Asp 

155 200 ^ 205 

Thr Arg Thr Ser Tyr Arg He Gly Asp Thr Trp Ser Lys Lys Asp Asn 

210 215 220 

Arg Gly Asn Leu Leu Gin Cys He Cys Thr Gly Asn Gly Arg Gly Glu 
225 230 235 240 

Trp Lys Cys Glu Arg His Thr Ser Val Gin Thr Thr Ser Ser Gly Ser 

245 250 255 

Gly Pro Phe Thr Asp Val Arg Ala Ala Val Tyr Gin Pro Gin Pro His 

260 265 270 

Pro Gin Pro Pro Pro Tyr Gly His Cys Val Thr Asp Ser Gly Val Val 

275 280 t 285 

Tyr Ser Val Gly Met Gin Trp Leu Lys Thr Gin Gly Asn Lys Gin Met 
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290 






Leu 


Cys 


Thr 


Cys 


305 








Thr 


Gin 


Thr 


Tyr 


Phe 


Thr 


Tyr 


Asn 








340 




Asp 


Gly His 






355 




Gin 


Lys 


Tyr 


Ser 




370 






Gly 


Gly Asn 


Ser 


385 








Asn 


His 


Asn 


Tyr 


Lys 


Trp 


Cys 


Gly 








420 


rne 


Cys 


Pro 


Met 






435 




Val 


Met 


Tyr Arg 




450 






His 


Met 


Met Arg 


465 








Cys 


He 


Ala 


Tyr 


Thr 


Tyr Asn Val 








500~ 


Met 


Leu 


Asn 


Cys 






515 




Asp 


Pro 


Val 


Asp 




530 






He 


Gly Asp 


Ser 


545 








Tyr 


Cys 


Tyr Gly 


Thr 


Tyr 


Pro 


Ser 








580 


Pro 


Ser 


Gin 


Pro 






595 




Ser 


His 


He 


Ser 




610 






Gly 


Arg 


Trp 


Lys 


625 








He 


Lys 


Gly Leu 


He 


Gin 


Gin 


Tyr 








660 


Thr 


Ser 


Thr 


Ser 






675 




Thr 


Pro 


Phe 


Ser 




690 






Thr 


Ala 


Ser 


Ser 



705 



295 

Leu Gly Asn Gly 
310 

Gly Gly Asn Leu 
325 

Gly Arg Thr Phe 

Leu Trp Cys Ser 
360 

Phe Cys Thr Asp 
375 

Asn Gly Ala Leu 
390 

Thr Asp Cys Thr 
405 

Thr Thr Gin Asn 

Ala Ala His Glu 
440 

He Gly Asp Gin 
455 

Cys Thr Cys Val 
470 

Ser Gin Leu Arg 
485 

Asn Asp Thr Phe 

Thr Cys Phe Gly 
520 

Gin Cys Gin Asp 
535 

Trp Glu Lys Tyr 
550 

Arg Gly He Gly 
565 

Ser Ser Gly Pro 

Asn Ser His Pro 
600 

Lys Tyr He Leu 
615 

Glu Ala Thr He 
630 

Lys Pro Gly Val 
645 

Gly His Gin Glu 

Thr Pro Val Thr 
680 

Pro Leu Val Ala 
695 

Phe Val Val Ser 
710 



300 



Val 


Ser 


Cys 


Gin 






315 




Asn Gly 


Glu 


Pro 




330 






Tyr 


Ser 


Cys 


Thr 


345 








rpl_. ~» 

±nr 


Thr 


Ser 


Asn 


His 


Thr 


Val 


Leu 








380 


Cys 


His 


Phe 


Pro 






3 95 




Ser 


Glu 


Gly Arg 




410 






Tyr 


Asp 


Ala 


Asp 


425 








Glu 


He 


Cys 


Thr 


Trp 


Asp 


Lys 


Gin 








460 


Gly Asn 


Gly Arg 






475 




Asp 


Gin 


Cys 


He 




490 






His 


Lys 


Arg 


His 


505 








Gin Gly 


Arg 


Gly 


Ser 


Glu 


Thr 


Gly 








540 


Val 


His 


Gly Val 






555 




Glu 


Trp 


His 


Cys 




570 






Val 


Glu 


Val 


Phe 


585 








He 


Gin 


Trp 


Asn 


Arg 


Trp 


Arg 


Pro 








620 


Pro Gly 


His 


Leu 






635 




Val 


Tyr 


Glu 


Gly 




650 






Val 


Thr 


Arg 


Phe 


665 








Ser 


Asn 


Thr 


Val 


Thr 


Ser 


Glu 


Ser 








700 


Trp 


Val 


Ser 


Ala 






715 





Glu 


Thr 


Ala 


Val 








J Z U 


Cys 


Val 


Leu 


Pro 






335 




Thr 


Glu 


Gly 


Arg 




350 






Tyr 


Glu 


Gin 


Asp 


365 








Val 


Gin 


Thr 


Gin 


Phe 


Leu 


Tyr 


Asn 








a r\ f\ 


Arg 


Asp 


Asn 


Met 






415 




Gin 


Lys 


Phe 


Gly 




430 






Thr 


Asn 


Glu 


Gly 


445 








His 


Asp 


Met 


Gly 


Gly Glu Trp 


Thr 








a a n 


Val 


Asp 


Asp 


He 






495 




Glu 


Glu 


Gly 


His 




510 






Arg 


Trp 


Lys 


Cys 


525 








Thr 


Phe 


Tyr 


Gin 


Arg 


Tyr 


Gin 


Cys 








c c c\ 


Gin 


Pro 


Leu 


Gin 






575 




He 


Thr 


Glu 


Thr 




590 






Ala 


Pro 


Gin 


Pro 


605 








Lys 


Asn 


Ser 


Val 


Asn 


Ser 


Tyr 


Thr 








D ft U 


Gin 


Leu 


He 


Ser 






655 




Asp 


Phe 


Thr 


Thr 




670 






Thr Gly Glu 


Thr 


•685 








Val 


Thr 


Glu 


He 


Ser 


Asp 


Thr 


Val 



720 
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Ser Gly Phe Arg Val Glu Tyr Glu Leu Ser Glu Glu Gly Asp Glu Pro 

725 730 735 

Gin Tyr Leu Asp Leu Pro Ser Thr Ala Thr Ser Val Asn lie Pro Asp 

740 745 750 

Leu Leu Pro Gly Arg Lys Tyr He Val Asn Val Tyr Gin He Ser Glu 

755 760 765 

Asp Gly Glu Gin Ser Leu He Leu Ser Thr Ser Gin Thr Thr Ala Pro 

770 775 780 

Asp Ala Pro Pro Asp Pro Thr Val Asp Gin Val Asp Asp Thr Ser lie 
785 790 795 800 

Val Val Arg Trp Ser Arg Pro Gin Ala Pro He Thr Gly Tyr Arg He 

805 810 815 

Val Tyr Ser Pro Ser Val Glu Gly Ser Ser Thr Glu Leu Asn Leu Pro 

820 825 830 

Glu Thr Ala Asn Ser Val Thr Leu Ser Asp Leu Gin Pro Gly Val Gin 

835 840 845 

Tyr Asn He Thr He Tyr Ala Val Glu Glu Asn Gin Glu Ser Thr Pro 

850 855 860 

Val Val He Gin Gin Glu Thr Thr Gly Thr Pro Arg Ser Asp Thr Val 
865 870 875 880 

Pro Ser Pro Arg Asp- Leu Gin Phe Val Glu Val Thr Asp Val Lys Val 

885 890 895 

Thr He Met Trp Thr Pro Pro Glu Ser Ala Val Thr Gly Tyr Arg' Val 

900 905 910 

Asp Val He Pro Val Asn Leu Pro Gly Glu His Gly Gin Arg Leu Pro 

915 920 925 

He Ser Arg Asn Thr Phe Ala Glu Val Thr Gly Leu Ser Pro Gly Val 

930 935 940 

Thr Tyr Tyr Phe Lys Val Phe Ala Val Ser His Gly Arg Glu Ser Lys 
945 950 955 ** " 960 

Pro Leu Thr Ala Gin Gin Thr Thr Lys Leu Asp Ala Pro Thr Asn Leu 

965 970 975 

Gin Phe Val Asn Glu Thr Asp Ser Thr Val Leu Val Arg Trp Thr Pro 

980 985 990 

Pro Arg Ala Gin He Thr Gly Tyr Arg Leu Thr Val Gly Leu Thr Arg 

995 1000 1005 

Arg Gly Gin Pro Arg Gin Tyr Asn Val Gly Pro Ser Val Ser Lys Tyr 

1010 1015 1020 

Pro Leu Arg Asn Leu Gin Pro Ala Ser Glu Tyr Thr Val Ser Leu Val 
1025 1030 1035 1040 

Ala He Lys Gly Asn Gin Glu Ser Pro Lys Ala Thr Gly Val Phe Thr 

1045 1050 1055 

Thr Leu Gin Pro Gly Ser Ser He Pro Pro Tyr Asn Thr Glu Val Thr 

1060 1065 1070 

Glu Thr Thr He Val He Thr Trp Thr Pro Ala Pro Arg He Gly Phe 

1075 1080 1085 

Lys Leu Gly Val Arg Pro Ser Gin Gly Gly Glu Ala Pro Arg Glu Val 

1090 1095 1100 

Thr Ser Asp Ser Gly Ser He Val Val Ser Gly Leu Thr Pro Gly Val 
1105 1110 ins 1120 

Glu Tyr Val Tyr Thr He Gin Val Leu Arg Asp Gly Gin Glu Arg . Asp 

1125 1130 1135 

Ala Pro He Val Asn Lys Val Val Thr Pro Leu Ser Pro Pro Thr Asn 
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1140 1145 1150 

Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Leu Thr Val Ser Trp 

1155 1160 1165 

Glu Arg Ser Thr Thr Pro Asp lie Thr Gly Tyr Arg He Thr Thr Thr 

1170 1175 1180 

Pro Thr Asn Gly Gin Gin Gly Asn Ser Leu Glu Glu Val Val His Ala 
1185 1190 1195 1200 

Asp Gin Ser Ser Cys Thr Phe Asp Asn Leu Ser Pro Gly Leu Glu Tyr 

1205 1210 1215 

Asn Val Ser Val Tyr Thr Val Lys Asp Asp Lys Glu Ser Val Pro He 

1220 1225 1230 

Ser Asp Thr He He Pro Ala Val Pro Pro Pro Thr Asp Leu Arg Phe 

1235 1240 1245 

Thr Asn He Gly Pro Asp Thr Met Arg Val Thr Trp Ala Pro Pro Pro 

1250 1255 1260 

Ser He Asp Leu Thr Asn Phe Leu Val Arg Tyr Ser Pro Val Lys Asn 
1265 1270 1275 1280 

Glu Glu Asp Val Ala Glu Leu Ser He Ser Pro Ser Asp Asn Ala Val 

1285 1290 1295 

Val Leu Thr Asn Leu Leu Pro Gly Thr Glu Tyr Val Val Ser Val Ser 

1300 1305 1310 

Ser Val Tyr Glu Gin His Glu Ser Thr Pro Leu Arg Gly Arg Gin Lys 

1315 1320 1325 

Thr Gly Leu Asp Ser Pro Thr Gly He Asp Phe. Ser Asp He Thr Ala 

13.30 1335 1340 

Asn Ser Phe Thr Val His Trp He Ala Pro Arg Ala Thr lie Thr Gly 
1345 1350 1355 1360 

Tyr Arg He Arg His His Pro Glu His Phe Ser Gly Arg Pro Arg Glu 

1365 1370 1375 

Asp Arg Val Pro His Ser Arg Asn Ser He Thr Leu Thr Asn Leu Thr 

1380 1385 1390 

Pro Gly Thr Glu Tyr Val Val Ser He Val Ala Leu Asn Gly Arg Glu 

1395 1400 1405 

Glu Ser Pro Leu Leu He Gly Gin Gin Ser Thr Val Ser Asp Val Pro 

1410 1415 1420 

Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu He Ser 
1425 1430 1435 1440 

Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg He Thr Tyr Gly 

1445 1450 1455 

Glu Thr Gly Gly Asn Ser Pro Val Gin Glu Phe Thr Val Pro Gly Ser 

1460 1465 1470 

Lys Ser Thr Ala Thr He Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr 

1475 1480 1485 

He Thr Val Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser 

1490 1495 1500 

Lys Pro He Ser He Asn Tyr Arg Thr Glu He Asp Lys Pro Ser Gin 
1505 1510 1515 1520 

Met Gin Val Thr Asp Val Gin Asp Asn Ser He Ser Val Lys Trp Leu 

1525 1530 1535 

Pro Ser Ser Ser Pro Val Thr Gly Tyr Arg Val Thr Thr Thr Pro Lys 

1540 1545 1550 

Asn Gly Pro Gly Pro Thr Lys Thr Lys Thr Ala Gly Pro Asp Gin Thr 
1555 1560 1565 
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Glu Met Thr lie Glu Gly Leu Gin Pro Thr Val Glu Tyr Val Val Ser 

1570 1575 1580 

Val Tyr Ala Gin Asn Pro Ser Gly Glu Ser Gin Pro Leu Val Gin Thr 
!585 1590 1595 160i 

Ala Val Thr Asn He Asp Arg Pro Lys Gly Leu Ala Phe Thr Asp Val 

1605 1610 1615 

Asp Val Asp Ser He Lys He Ala Trp Glu Ser Pro Gin Gly Gin Val 

1620 1625 1630 

Ser Arg Tyr Arg Val Thr Tyr Ser Ser Pro Glu Asp Gly He His Glu 

1635 1640 1645 

Leu Phe Pro Ala Pro Asp Gly Glu Glu Asp Thr Ala Glu Leu Gin Gly 

1650 1655 1660 

Leu Arg Pro Gly Ser Glu Tyr Thr Val Ser Val Val Ala Leu His Asp 
1665 1670 1675 168C 

Asp Met Glu Ser Gin Pro Leu He Gly Thr Gin Ser Thr Ala He Pro 

1685 1690 1695 

Ala Pro Thr Asp Leu Lys Phe Thr Gin Val Thr Pro Thr Ser Leu Ser 

1700 1705 1710 

Ala Gin Trp Thr Pro Pro Asn Val Gin Leu Thr Gly Tyr Arg Val Arg 

1715 1720 1725 

Val Thr Pro Lys Glu Lys Thr Gly Pro Met Lys Glu He Asn Leu Ala 

1730 1735 1740 

Pro Asp Ser Ser Ser Val Val Val Ser Gly Leu Met Val Ala Thr Lys 
1745 1750 1755 176C 

Tyr Glu Val Ser Val Tyr Ala Leu Lys Asp Thr Leu Thr Ser Arg Pro 

1765 1770 1775 

Ala Gin Gly Val Val Thr Thr Leu Glu Asn Val Ser Pro Pro Arg Arg 

1780 1785 1790 

Ala Arg Val Thr Asp Ala Thr Glu Thr Thr He Thr He Ser Trp Arg 

1795 1800 1805 

Thr Lys Thr Glu Thr He Thr Gly Phe Gin Val Asp Ala Val Pro Ala 

1810 1815 1820 

Asn Gly Gin Thr Pro He Gin Arg Thr He Lys Pro Asp Val Arg Ser 
1825 1830 1835 1840 

Tyr Thr He Thr Gly Leu Gin Pro Gly Thr Asp Tyr Lys He Tyr Leu 

1845 1850 1855 

Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val lie Asp Ala 

I860 1865 1870 

Ser Thr Ala He Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr 

1875 1880 ~ 1885 

Pro Asn Ser Leu Leu Val Ser Trp Gin Pro Pro Arg Ala Arg He Thr 

1890 1895 1900 

Gly Tyr He He Lys Tyr Glu Lys Pro Gly Ser Pro Pro Arg Glu Val 
1905 1910 1915 1920 

Val Pro Arg Pro Arg Pro Gly Val Thr Glu Ala Thr He Thr Gly Leu 

1925 1930 1935 

Glu Pro Gly Thr Glu Tyr Thr He Tyr Val He Ala Leu Lys Asn Asn 

1940 1945 1950 

Gin Lys Ser Glu Pro Leu He Gly Arg Lys Lys Thr Asp Glu Leu Pro 

1955 1960 1965 

Gin Leu Val Thr Leu Pro His Pro Asn Leu His Gly Pro Glu He Leu 

1970 1975 1980 

Asp Val Pro Ser Thr Val Gin Lys Thr Pro Phe Val Thr His Pro Gly 
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1985 



1990 



1995 



2000 



Tyr Asp Thr Gly Asn Gly lie Gin Leu Pro Gly Thr Ser Gly Gin Gin 

2005 2010 2015 

Pro Ser Val Gly Gin Gin Met lie Phe Glu Glu His Gly Phe Arg Arg 

2020 2025 2030 

Thr Thr Pro Pro Thr Thr Ala Thr Pro lie Arg His Arg Pro Arg Pro 

2035 2040 2045 

Tyr Pro Pro Asn Val Gly Gin Glu Ala Leu Ser Gin Thr Thr lie Ser 

2050 2055 2060 

Trp Ala Pro Phe Gin Asp Thr Ser Glu Tyr lie lie Ser Cys His Pro 
2065 2070 2075 2080 

Val Gly Thr Asp Glu Glu Pro Leu Gin Phe Arg Val Pro Gly Thr Ser 

2085 2090 2095 

Thr Ser Ala Thr Leu Thr Gly Leu Thr Arg Gly Ala Thr Tyr Asn lie 

2100 2105 2110 

lie Val Glu Ala Leu Lys Asp Gin Gin Arg His Lys Val Arg Glu Glu 

2115 2120 2125 

Val Val Thr Val Gly Asn Ser Val Asn Glu Gly Leu Asn Gin Pro Thr 

. 2130 2135 2140 

Asp Asp Ser Cys Phe Asp Pro Tyr Thr Val Ser His Tyr Ala Val Gly 
2145 2150 2155 2160 

Asp Glu Trp Glu Arg Met Ser Glu Ser Gly Phe Lys Leu Leu Cys Gin 

2165 . 2170 2175 

Cys Leu Gly Phe Gly Ser Gly His Phe Arg Cys Asp Ser Ser Arg Trp 

2180 2185 2190 

Cys His Asp Asn Gly Val Asn Tyr Lys He Gly Glu Lys Trp Asp Arg 

2195 2200 2205 

Gin Gly Glu Asn Gly Gin, Met Met Ser Cys Thr Cys Leu Gly Asn Gly 

2210 2215 2220 

Lys Gly Glu Phe Lys Cys Asp Pro His Glu Ala Thr Cys Tyr Asp Asp 
2225 2230 2235 2240 

Gly Lys Thr Tyr His Val Gly Glu Gin Trp Gin Lys Glu Tyr Leu Gly 

2245 2250 2255 

Ala He Cys Ser Cys Thr Cys Phe Gly Gly Gin Arg Gly Trp Arg Cys 

2260 2265 2270 

Asp Asn Cys Arg Arg Pro Gly Gly Glu Pro Ser Pro Glu Gly Thr Thr 

2275 2280 2285 

Gly Gin Ser Tyr Asn Gin Tyr Ser Gin Arg Tyr His Gin Arg Thr Asn 

2290 2295 2300 

Thr Asn Val Asn Cys Pro lie Glu Cys Phe Met Pro Leu Asp Val Gin 
2305 2310 2315 2320 

Ala Asp Arg Glu- Asp Ser Arg Glu 



<210> 99 

<211> 188 

<212> PRT 

<213> Homo sapiens 

<400> 99 

His Gin Thr His Lys Glu Gly Gly Ser Thr His Ala Ser Ala Asp Ala 

15 10 15 

Trp Glu He He Glu Leu Glu Thr Glu He Glu Lys Phe Lys Ala Glu 



2325 
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20 










25 










30 






Asn 


Ala 


Ser 
35 


Leu 


Ala 


Lys 


Leu 


Arg 
40 


He 


Glu 


Arg 


Glu 


Ser 
45 


Ala 


Leu 


Glu 


Lys 


Leu 


Arg 


Lys 


Glu 


He 


Ala 


Asp Phe Glu Gin Gin Lys Ala Lys Glu 




50 










55 










60 










Leu 


Ala 


Arg 


He 


Glu 


Glu 


Phe 


Lys 


Lys 


Glu 


Glu 


Met 


Arg 


Lys 


Leu 


Gin 


65 










70 










75 










80 


Lys 


Glu 


Arg 


Lys 


Val 
85 


Phe 


Glu 


Lys 


Tyr 


Thr 
90 


Thr 


Ala 


Ala 


Arg 


Thr 
95 


Phe 


Pro 


Asp 


Lys 


Lys 


Glu Arg Glu 


Glu 


He 


Gin 


Thr 


Leu 


Lys 


Gin 


Gin 


He 








100 










105 










110 






Ala 


Asp 


Leu 
115 


Arg 


Glu 


Asp 


Leu 


Lys 
120 


Arg 


Lys 


Glu 


Thr 


Lys 
125 


Trp 


Ser 


Ser 


Thr 


His 
13 0 


Ser 


Arg 


Leu 


Arg 


Ser 
135 


Gin 


He 


Gin 


Met 


Leu 
140 


Val 


Arg 


Glu 


Asn 


Thr 


Asp 


Leu 


Arg 


Glu 


Glu 


He 


Lys 


Val 


Met 


Glu 


Arg 


Phe 


Arg 


Leu 


Asp 


145 










150 










155 










160 


Ala 


Trp 


Lys 


Arg 


Ala 
165 


Glu 


Ala 


He 


Glu 


Ser 
170 


Ser 


Leu 


Glu 


Val 


Glu 
175 


Lys 


Lys 


Asp 


Lys 


Leu 
180 


Ala 


Asn 


Thr 


Ser 


Val 
185 


Arg 


Phe 


Gin 











<210> 100 
<211> 284 
<212> PRT 

<213> Homo sapiens 
<400> 100 



Met 


Glu 


Pro 


Gly Asn Tyr. Ala 


Thr 


Leu 


Asp Gly Ala 


Lys Asp 


He Glu 


1 






5 






10 




15 


Gly 


Leu 


Leu 


Gly Ala Gly Gly Gly Arg Asn Leu Val Ala His 


Ser Pro 








20 




25 




30 




Leu 


Thr 


Ser 


His Pro Ala Ala 


Pro 


Thr 


Leu Met Pro 


Ala Val 


Asn Tyr 






35 




40 






45 


Ala 


Pro 


Leu 


Asp Leu Pro Gly 


Ser 


Ala 


Glu Pro Pro 


Lys Gin 


Cys His 




50 




55 






60 






Pro 


Cys 


Pro 


Gly Val Pro Gin Gly Thr 


Ser Pro Ala 


Pro Val 


Pro Tyr 


65 






70 






75 




80 


Gly 


Tyr 


Phe 


Gly Gly Gly Tyr 


Tyr 


Ser 


Cys Arg Val 


Ser Arg 


Ser Ser 








85 






90 




95 


Leu 


Lys 


Pro 


Cys Ala Gin Ala 


Ala 


Thr 


Leu Ala Ala 


Tyr Pro 


Ala Glu 








100 




105 




110 




Thr 


Pro 


Thr 


Ala Gly Glu Glu 


Tyr 


Pro 


Ser Arg Pro 


Thr Glu 


Phe Ala 






115 




120 






125 




Phe 


Tyr 


Pro 


Gly Tyr Pro Gly Thr Tyr His Ala Met Ala Ser Tyr Leu 




130 




135 






140 






Asp 


Val 


Ser 


Val Val Gin Thr 


Leu 


Gly Ala Pro Gly Glu Pro Arg His 


145 






150 






155 




160 


Asp 


Ser 


Leu 


Leu Pro Val Asp 


Ser 


Tyr 


Gin Ser Trp Ala Leu Ala Gly 








165 






170 




175 


Gly 


Trp 


Asn 


Ser Gin Met Cys 


Cys 


Gin 


Gly Glu Gin Asn Pro 


Pro Gly 








180 




185 




190 




Pro 


Phe 


Trp 


Lys Ala Ala Phe 


Ala 


Asp 


Ser Ser Gly Gin His 


Pro Pro 
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195 










200 








205 








Asp 


Ala 


C Y S 


Ala 


Phe 


Arg 


Arg Gly 


Arg 


Lys 


Lys 


Arg He 


Pro 


Tyr 


Ser 




210 










2 15 










220 








Lys 


Gly 


Gin 


Leu 


Arg 


Glu 


Leu 


Glu 


Arg 


Glu 


Tyr 


Ala Ala 


Asn 


Lys 


Phe 


one 




















235 










lie 


Thr 


Lys 


Asp 


Lys 


Arg 


Arg 


Lys 


He 


Ser 


Ala 


Ala Thr 


Ser 


Leu 


Ser 










245 










250 








255 




Glu 


Arg 


Gin 


lie 


Thr 


lie 




Phe 


Gin 


Asn 


Arg Arg Val 


Lys 


Glu 


Lys 








260 










265 








270 






Lys 


Val 


Leu 


Ala 


Lys 


V CLJL 


Lys 


Asn 


OCX 


nl a. 


Thr 


Pro 












275 










280 
















<210> 101 


























<211> 676 


























<212> PRT 


























<213> Homo sapiens 






















<400> 101 


























Met 


Asp 


Lys 


Tyr 


Asp 


Asp 


Leu 


Gly 


Leu 


Glu 


Ala 


Ser Lys 


Phe' 


He 


Glu 


1 








5 










10 








15 




Asp 


Leu 


Asn 


Met 


Tyr 


Glu 


Ala 


Ser 


Lys 


Asp 


Gly Leu Phe 


Arg 


Val 


Asp 








20 










25 








30 






Lys 


Gly 


Ala 


Gly 


Asn 


Asn 


Pro 


Glu 


Phe 


Glu 


Glu 


Thr Arg 


Arg 


Val 


Phe 






35 










40 








45 








Ala 


Thr 


Lys 


Met 


Ala 


Lys 


He 


His 


Leu 


Gin 


Gin 


Gin Gin 


Gin 


Gin 


Leu 




50 










55 










60 








Leu 


Gin 


Glu 


Glu 


Thr 


Leu 


Pro 


Arg 


Gly Ser 


Arg 


Gly Pro 


Val 


Asn 


Gly 


65 










70 










75 








80 


Gly 


Gly 


Arg 


Leu 


Gly Pro 


Gin 


Ala 


Arg 


Trp 


Glu 


Val Val 


Gly 


Ser 


Lys 










85 










90 








95 




Leu 


Thr 


Val 


Asp 


Gly Ala 


Ala 


Lys 


Pro 


Pro 


Leu 


Ala Ala 


Ser 


Thr 


Gly 








100 










105 








110 






Ala 


Pro 


Gly 


Ala 


Val 


Thr 


Thr 


Leu 


Ala 


Ala 


Gly Gin Pro 


Pro 


Tyr 


Pro 






115 










120 








125 








Pro 


Gin 


Glu 


Gin 


Arg 


Ser 


Arg 


Pro 


Tyr 


Leu 


His 


Gly Thr 


Arg 


His 


Gly 




13 0 










135 










140 








Ser 


Gin 


Asp 


Cys 


Gly Ser 


Arg 


Glu 


Ser 


Leu 


Ala 


Thr Ser 


Glu 


Met 


Ser 


145 










150 










155 








160 


Ala 


Phe 


His 


Gin 


Pro 


Gly 


Pro 


Cys 


Glu 


Asp 


Pro 


Ser Cys 


Leu 


Thr 


His 










165 










170 








175 




Gly 


Asp 


Tyr 


Tyr 


Asp Asn 


Leu 


Ser 


Leu 


Ala 


Ser 


Pro Lys 


Trp 


Gly 


Asp 








180 










185 








190 






Lys 


Pro 


Gly 


Val 


Ser 


Pro 


Ser 


He 


Gly Leu 


Ser 


Val Gly 


Ser 


Gly 


Trp 






195 










200 








205 








Pro 


Ser 


Ser 


Pro 


Gly Ser 


Asp 


Pro 


Pro 


Leu 


Pro 


Lys Pro 


Cys 


Gly 


Asp 




210 










215 










220 








His 


Pro 


Leu 


Asn 


His 


Arg 


Gin 


Leu 


Ser 


Leu 


Ser 


Ser Ser 


Arg 


Ser 


Ser 


225 










230 










235 








240 


Glu 


Gly 


Ser 


Leu 


Gly Gly 


Gin 


Asn 


Ser Gly 


He 


Gly Gly 


Arg 


Ser 


Ser 










245 










250 








255 




Glu 


Lys 


Pro 


Thr 


Gly Leu 


Trp 


Ser 


Thr 


Ala 


Ser 


Ser Gin 


Arg 


Val 


Ser 








260 










265 








270 






Pro 


Gly 


Leu 


Pro 


Ser 


Pro 


Asn 


Leu 


Glu 


Asn 


Gly Ala Pro 


Ala 


Val 


Gly 
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275 

Pro Val Gin Pro 
290 

Cys Pro Arg Gin 
305 

Glu Val Ser Gly 

Phe Gin Asp Gly 
340 

Ser Pro Ala Gly 
355 

Gly Pro Lys Pro 
370 

Pro Thr Ser Leu 
385 

Cys Lys Glu Gly 

Val Leu Leu Asp 
420 

Pro Leu Val Pro 
435 

Leu Glu Ala Leu 
450 

Lys Ala Asp Tyr 
465 

Gly Ala Gly Gin 

Cys Phe Thr Cys 
500 

Tyr Phe Val Asn 
515 

Gly Phe Gin Gin 
530 

Met Asp Met lie 
545 

Phe Arg Cys Val 

Val Asp Ser Glu 
580 

Leu Ala Pro Lys 
595 

Gly Ser Asp Glu 
610 

Val Glu Cys Tyr 
625 

Asp Gly His Arg 

Cys His Val Lys 
660 

Gin His His Phe 
675 



280 

Arg Thr Pro Ser 
295 

Gly Gly Leu Pro 
310 

Val Met Ser Lys 
325 

Pro Lys Ser Tyr 

Leu Asp Gly Ser 
360 

Gly Cys Thr Asp 
375 

Val His Pro Val 
390 

Pro Leu Gly Trp 
405 

Ser Pro Ser Ser 

Gly Pro Glu Leu 
440 

Thr Gin Arg Leu 
455 

Phe Gly Ala Cys 
470 

Ala Cys Gin Ala 
485 

Ala Ala Cys Ser 

Gly Lys Val Phe 
520 

Ser Ala Asp Arg 
535 

Leu Gin Ala Leu 
550 

lie Cys Asn Glu 
5 65 

Asn Lys lie Tyr 

Cys Ala Ala Cys 
600 

Thr lie Arg Val 
615 

His Cys Glu Asp 
630 

Cys Tyr Pro Leu 
645 

Arg Leu Glu Lys 



Val Ser Ala Pro 
300 

Arg Ser Asn Ser 
315 

Pro Asn Val Asp 
330 

Leu Ser Ser Ser 
345 

Gin Gin Gly Ala 

Leu Gly Thr Gly 
380 

Met Ser Thr Leu 
395 

Ser Ser Asp Gly 
410 

Pro Arg Val Arg 
425 

Arg Pro Ser Ala 

Glu Arg Glu Met 
460 

Val Lys Cys Ser 
475 

Met Gly Asn Leu 
490 

Arg Lys Leu Arg 
505 

Cys Glu Glu Asp 

Cys Phe Leu Cys 
540 

Gly Lys Ser Tyr 
555 

Cys Leu Asp Gly 
570 

Cys Val Arg Asp 
585 

Gly Leu Pro lie 

Val Ser Met Asp 
620 

Cys Gly Leu Glu 
635 

Glu Asp His Leu 
650 

Arg Pro Ser Ser 
665 



285 

Leu Ala Leu Ser 

Gly Leu Gly Gly 
320 

Pro Gin Pro Trp 
335 

Ala Pro Ser Ser 
350 

Val Pro Gly r Leu 
3 65 

Pro Lys Leu Ser 

Pro Glu Leu Ser 
400 

Ser Leu Gly Ser 
415 

Leu Pro Cys Gin 
430 

Ala Glu Leu Lys 
445 

Asp Ala His Pro 

Lys Gly Val Phe 
480 

Tyr His Asp Thr 
495 

Gly Lys Ala Phe 
510 

Phe Leu Tyr Ser 
525 

Gly His Leu lie 

His Pro Gly Cys 
560 

Val Pro Phe Thr 
575 

Tyr His Lys Val 
590 

Leu Pro Pro Glu 
605 

Arg Asp Tyr His 

Leu Asn Asp Glu 
640 

Phe Cys His Ser 
655 

Thr Ala Leu His 
670 



<210> 102 
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<211> 296 
<212> PRT 

<213> Homo sapiens 



<400> 102 



Ser 


Thr 


Gly 


Ser 


1 








Ala 


Ala 


Arg 


Ala 








20 


Arg 


Glu 


Thr 


Arg 






35 




Thr 


Met 


Thr Gly 




50 






Gin 


Arg 


Cys 


Gin 


65 








Asn 


Gly 


Glu 


Leu 


Arg 


Pro 


Phe 


Pro 








100 


Cys 


Glu 


His 


Asp 






115 




Gly 


Glu 


Phe 


lie 




130 






His 


Pro 


Gly Cys 


145 








Leu 


Gly 


Phe 


Val 


Asn 


Arg 


Glu 


Lys 








180 


His 


Leu 


Val 


He 






195 




His 


Pro 


As P 


His 




210 






Glu 


Ala 


Arg 


Glu 


225 








Lys 


Met 


Gly Val 


Arg 


Val 


Val 


Asn 








260 


Cys 


Ala 


Lys 


Cys 






275 




Lys 


Gly 


Leu 


Ala 




290 







Glu 


Phe 


Pro 


Leu 


5 








Gly 


Gly Arg Ala 


Gly 


Asn 


Pro 


Gly 








40 


Ser 


Asn 


Met 


Ser 






55 




Ala 


Arg 


Phe 


Ser 




70 






Tyr 


His 


Glu 


His 


85 








Glu 


Gly 


Leu 


Phe 


Phe 


Gin 


Met 


Leu 








120 


He 


Gly Arg Val 






135 




Phe 


Arg 


Cys 


Glu 




150 






Lys 


Asn 


Ala 


Gly 


165 








Ala 


Lys 


Gly Leu 


Asp 


Glu 


Gin 


Pro 








200 


Phe 


Asn 


Cys 


Thr 






215 




Leu 


Lys 


Gly Glu 




230 






Pro 


He 


Cys 


Gly 


245 








Ala 


Leu Gly Lys 


Glu 


Lys 


Pro 


Phe 








280 


Tyr 


Cys 


Glu 


Leu 






295 





Cys Thr Lys Ala 
10 

Leu Gly Trp Arg 
25 

Asn Pro Gly Leu 

Asp Ala Leu Ala 
60 

Pro Ala Glu Arg 
75 



Cys 


Phe Val 


Cys 




90 




Tyr Glu Phe 


Glu 


105 






Phe 


Ala Pro 


Cys 


He 


Lys Ala 


Met 






14 0 


Leu 


Cys Asp 


Val 




155 




Arg 


His Leu 


Cys 




170 




Gly Lys Tyr 


He 


185 






Leu 


Met Phe 


Arg 


His 


Cys Gly 


Lys 






220 


Leu 


Tyr Cys 


Leu 




235 




Ala 


Cys Arg 


Arg 




250 




Gin 


Trp His 


Val 


265 






Leu Gly His 


Arg 



Ser Pro Cys Ser 
15 

Leu Gin Gin Gin 
30 

Gly Val Ala Ala 
45 

Asn Ala Val Cys 

He Val Asn Ser 
80 

Ala Gin Cys Phe 
95 

Gly Arg Lys Tyr 
110 

Cys Gly Ser Cys 
125 

Asn Asn Asn Trp 

Glu Leu Ala Asp 
160 

Arg Pro Cys His 
175 

Cys Gin Arg Cys 
190- 

Ser Asp Ala Tyr 
205 

Glu Leu Thr Ala 

Pro Cys His Asp 
240 

Pro He Glu Gly 
255 

Glu His Phe Val 
270 

His Tyr Glu Lys 
285 



<210> 103 
<211> 500 
<212> PRT 

<213> Homo sapiens 



<400> 103 

Met Gly He Gly Leu Ser Ala Gin 

1 5 
Gly Trp Asp Lys His Ser Tyr Gly 



Gly Val Asn Met Asn Arg Leu Pro 

10 15 
Tyr His Gly Asp Asp Gly His Ser 
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20 



25 



• 30 



Phe Cys Ser Ser Gly Thr Gly Gin Pro Tyr Gly Pro Thr Phe Thr Thr 

35 40 . 45 

Gly Asp Val He Gly Cys Cys Val Asn Leu He Asn Asn Thr Cys Phe 

50 55 60 

Tyr Thr Lys Asn Gly His Ser Leu Gly He Ala Phe Thr Asp Leu Pro 
65 70 75 80 

Pro Asn Leu Tyr Pro Thr Val Gly Leu Gin Thr Pro Gly Glu Val Val 

85 90 95 

Asp Ala Asn Phe Gly Gin His Pro Phe Val Phe Asp He Glu Asp Tyr 

100 ' 105 110 

Met Arg Glu Trp Arg Thr Lys He Gin Ala Gin He Asp Arg Phe Pro 

115 120 125 

He Gly Asp Arg Glu Gly Glu Trp Gin Thr Met He Gin Lys Met Val 

13 0 135 140 

Ser Ser Tyr Leu Val His His Gly Tyr Cys Ala Thr Ala Glu Ala Phe 
145 150 155 160 

Ala Arg Ser Thr Asp Gin Thr Val Leu Glu Glu Leu Ala Ser He Lys 

165 170 175 

Asn Arg Gin Arg He Gin Lys Leu Val Leu Ala Gly Arg Met Gly Glu 

180 185 190 

Ala He Glu Thr Thr Gin Gin Leu Tyr Pro Ser Leu Leu Glu Arg Asn 

195 200- 205 

Pro Asn Leu Leu Phe Thr Leu Lys Val Arg Gin Phe He Glu Met Val 

210 215 220 

Asn Gly Thr Asp Ser Glu Val Arg Cys Leu Gly Gly Arg Ser Pro Lys 
225 230 235 240 

Ser Gin Asp Ser Tyr Pro Val Ser Pro Arg Pro Phe Ser Ser Pro Ser 

245 250 255 

Met Ser Pro Ser His Gly Met Asn He His Asn Leu Ala Ser Gly Lys 

260 265 270 

Gly Ser Thr Ala His Phe Ser Gly Phe Glu Ser Cys Ser Asn Gly Val 

275 280 285 

He Ser Asn Lys Ala His Gin Ser Tyr Cys His Ser Asn Lys His Gin 

290 295 300 

Ser Ser Asn Leu Asn Val Pro Glu Leu Asn Ser He Asn Met Ser Arg 
305 310 315 320 

Ser Gin Gin Val Asn Asn Phe Thr Ser Asn Asp Val Asp Met Glu Thr 

325 330 335 

Asp His Tyr Ser Asn Gly Val Gly Glu Thr Ser Ser Asn Gly Phe Leu 

340 345 350 

Asn Gly Ser Ser Lys His Asp His Glu Met Glu Asp Cys Asp Thr Glu 

355 360 , 365 

Met Glu Val Asp Ser Ser Gin Leu Arg Arg Gin Leu Cys Gly Gly Ser 

370 375 380 

Gin Ala Ala He Glu Arg Met He His Phe Gly Arg Glu Leu Gin Ala 
385 390 395 400 

Met Ser Glu Gin Leu Arg Arg Asp Cys Gly Lys Asn Thr Ala Ash Lys 

405 410 415 

Lys Met Leu Lys Asp Ala Phe Ser Leu Leu Ala Tyr Ser Asp Pro Trp 

420 425 430 

Asn Ser Pro Val Gly Asn Gin Leu Asp Pro He Gin Arg Glu Pro Val 



435 



440 



445 
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Cys Ser Ala Leu Asn Ser Ala lie Leu Glu Thr His Asn Leu Pro Lys 

450 455 460 

Gin Pro Pro Leu Ala Leu Ala Met Gly Gin Ala Thr Gin Cys Leu Gly 
465 470 475 480 

Leu Met Ala Arg Ser Gly lie Gly Ser Cys Ala Phe Ala Thr Val Glu 
485 490 495 

Asp Tyr Leu. His 



<210> 104 

<211> 387 

<212> PRT 

<213> Homo sapiens 



<400> 104 



Met 


Ala 


Thr 


Ser 


Gly Val 


Leu 


Pro Gly Gly 


Gly 


Phe 


Val 


Ala 


Ser 


Ala 


1 








5 






10 










15 




Ala 


Ala 


Val 


Ala Gly Pro 


Glu 


Met Gin Thr 


Gly Arg Asn Asn Phe 


Val 








20 








25 








30 






lie 


Arg 


Arg 


Asn 


Pro 


Ala 


Asp 


Pro Gin Arg 


He 


Pro 


Ser 


Asn 


Pro 


Ser 






35 










40 






45 








His 


Arg 


He 


Gin 


Cys 


Ala 


Ala Gly Tyr Glu 


Gin 


Ser 


Glu 


His 


Asn 


Val 




50 










55 






60 










Cys 


Gin 


Asp 


He 


Asp 


Glu 


Cys 


Thr Ala Gly 


Thr 


His 


Asn 


Cys 


Arg 


Ala 


65 










70 






75 










80 


Asp 


Gin 


Val 


Cys 


lie 


Asn 


Leu Arg Gly Ser 


Phe 


Ala 


Cys 


Gin 


Cys 


Pro 










85 






90 










95 




Pro 


Gly 


Tyr 


Gin 


Lys 


Arg 


Gly Glu Gin Cys 


Val 


Asp 


He 


Asp 


Glu 


Cys 








100 








105 








110 






Thr 


lie. 


Pro 


Pro 


Tyr 


Cys 


His 


Gin Arg Cys 


Val 


Asn 


Thr 


Pro Gly 


Ser 






115 










120 






125 








Phe 


Tyr 


Cys 


Gin 


Cys 


Ser 


Pro 


Gly Phe Gin 


Leu 


Ala 


Ala 


Asn 


Asn 


Tyr 




130 










135 






140 










Thr 


Cys 


Val 


Asp 


He 


Asn 


Glu 


Cys Asp Ala 


Ser 


Asn 


Gin 


Cys 


Ala 


Gin 


145 










150 






155 










160 


Gin 


Cys 


Tyr 


Asn 


He 


Leu 


Gly 


Ser Phe Tie 


Cys 


Gin 


Cys 


Asn 


Gin 


Gly 










165 






170 










175 




Tyr 


Glu 


Leu 


Ser 


Ser Asp 


Arg 


Leu Asn Cys 


Glu Asp 


He 


Asp 


Glu 


Cys 








180 








185 








190 






Arg 


Thr 


Ser 


Ser 


Tyr 


Leu 


Cys 


Gin Tyr Gin 


Cys 


Val 


Asn 


Glu 


Pro 


Gly 






195 










200 






205 








Lys 


Phe 


Ser 


Cys 


Met 


Cys 


Pro Gin Gly Tyr 


Gin 


Val 


Val 


Arg 


Ser 


Arg 




210 










215 






220 










Thr 


Cys 


Gin 


Asp 


He 


Asn 


Glu 


Cys Glu Thr 


Thr 


Asn 


Glu 


Cys 


Arg 


Glu 


225 










230 






235 










240 


Asp 


Glu 


Met 


Cys 


Trp 


Asn 


Tyr 


His Gly Gly 


Phe 


Arg 


Cys 


Tyr 


Pro 


Arg 










245 






250 










255 




Asn 


Pro 


Cys 


Gin Asp 


Pro 


Tyr 


He Leu Thr 


Pro 


Glu 


Asn 


Arg 


Cys 


Val 








260 








265 








270 






Cys 


Pro 


Val 


Ser 


Asn 


Ala 


Met 


Cys Arg Glu 


Leu 


Pro 


Gin 


Ser 


He 


Val 






275 










280 






285 








Tyr 


Lys 


Tyr 


Met 


Ser 


He 


Arg 


Ser Asp Arg 


Ser 


Val 


Pro 


Ser 


Asp 


He 



290 295 300 



500 
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Phe Gin 


He 


Gin 


Ala 


Thr 


Thr 


He Tyr Ala Asn Thr He 


Asn Thr Phe 


305 








310 




315 


320 


Arg He 


Lys 


Ser 


Gly Asn Glu 


Asn Gly Glu Phe Tyr Leu Arg Gin Thr 








325 






330 


335 


Ser Pro 


Val 


Ser 


Ala 


Met 


Leu 


Val Leu Val Lys Ser Leu 


Ser Gly Pro 






340 








345 


350 


Arg Glu 


His 


He 


Val 


Asp 


Leu 


Glu Met Leu Thr Val Ser 


Ser He Gly 




355 










360 365 


Thr Phe 


Arg 


Thr 


Ser 


Ser 


Val 


Leu Arg Leu Thr He lie 


Val Gly Pro 


370 










375 


380 


Phe Ser 


Phe 














385 
















<210> 105 














<211> 531 














<212> PRT 














<213> Homo ! 


sapiens 










<400> 105 














Met Ser 


Lys 


Pro 


til S 


Ser 


Glu 


Ala Gly Thr Ala Phe He 


Gin Thr Gin 


1 






5 






10 


15 


Gin Leu 


His 


Ala 


Ala 


Met 


Ala 


Asp Thr Phe Leu Glu His 


Met Cys Arg 






20 








25 


30 


Leu Asp He Asp 


Ser 


Pro 


Pro 


He Thr Ala Arg Asn Thr Gly He He 




35 










40 45 




Cys Thr 


He 


Gly 


Pro 


Ala 


Ser Arg Ser Val Glu Thr Leu 


Lys Glu Met 


50 










55 


60 




He Lys 


Ser 


Gly 


Met 


Asn 


Val 


Ala Arg Leu Asn Phe Ser 


His Gly Thr 


65 








70 




75 


80 


His Glu 


Tyr 


His 


Ala 


Glu 


Thr 


He Lys Asn Val Arg Thr 


Ala Thr Glu 








85 






90 


95 


Ser Phe 


Ala 


Ser 


Asp 


Pro 


Tyr 


Leu Tyr Arg Pro Val Ala 


Val Ala Leu 






100 








105 


110 


Asp Thr 


Lys 


Gly 


Pro 


Glu 


He Arg Thr Gly Leu He Lys 


Gly Ser Gly 




115 










120 125 




Thr Ala 


Glu 


Leu 


Glu 


Leu 


Lys 


Lys Gly Ala Thr Leu Lys 


He Thr Leu 


130 










135 


140 




Asp Asn 


Ala 


Tyr 


Met 


Glu 


Lys 


Cys Asp Glu Asn He Leu 


Trp Leu Asp 


145 








150 




155 


160 


Tyr Lys 


Asn 


He 


Cys 


Lys 


Val 


Val Glu Val Gly Ser Lys 


He Tyr Val 








165 






170 


175 


Asp Asp Gly Leu 


He 


Ser 


Leu 


Gin Val Lys Gin Lys Gly Ala Asp Phe 






180 








185 


190 


Leu Val 


Thr 


Glu 


Val 


Glu 


Asn 


Gly Gly Ser Leu Gly Ser Lys Lys Gly 




195 










200 205 




Val Asn 


Leu 


Pro 


Gly Ala Ala Val Asp Leu Pro Ala Val 


Ser Glu Lys 


210 










215 


220 




Asp He 


Gin Asp 


Leu 


Lys 


Phe 


Gly Val Glu Gin Asp Val Asp Met Val 


225 








230 




235 


240 


Phe Ala 


Ser 


Phe 


He 


Arg 


Lys 


Ala Ser Asp Val His Glu 


Val Arg Lys 








245' 






250 


255 


Val Leu 


Gly Glu 


Lys 


Gly Lys 


Asn lie Lys He He Ser Lys He Glu 






260 








265 


270 
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Asn 


His 


Glu 


Gly 


Val 


Arg 


Arg 


Phe 


Asp 


Glu 


He 


Leu 


Glu 


Ala 


Ser 


Asp 






275 










280 










285 








Gly 


He 


Met 


Val 


Ala 


Arg 


Gly Asp 


Leu 


Gly 


He 


Glu 


He 


Pro 


Ala 


Glu 




290 










295 










3 0.0 










Lys 


Val 


Phe 


Leu 


Ala 


Gin 


Lys 


Met 


Met 


He 


Gly Arg Cys 


Asn Arg 


Ala 


305 










310 










315 










320 


Gly 


Lys 


Pro 


Val 


He 


Cys 


Ala 


Thr 


Gin 


Met 


Leu 


Glu 


Ser 


Met 


He 


Lys 










325 










330 










335 




Lys 


Pro 


Arg 


Pro 


Thr 


Arg 


Ala 


Glu 


Gly 


Ser 


Asp 


Val 


Ala 


Asn 


Ala 


Val 








340 










345 










350 






Leu 


Asp 


Gly 


Ala 


Asp 


Cys 


He 


Met 


Leu 


Ser 


Gly 


Glu 


Thr 


Ala 


Lys 


Gly 






355 










360 










365 








Asp 


Tyr 


Pro 


Leu 


Glu 


Ala 


Val 


Arg 


Met 


Gin 


His 


Leu 


He 


Ala 


Arg 


Glu 




370 










375 










380 










Ala 


Glu 


Ala 


Ala 


He 


Tyr 


His 


Leu 


Gin 


Leu 


Phe 


Glu 


Glu 


Leu 


Arg 


Arg 


385 










390 










395 










400 


Leu 


Ala 


Pro 


lie 


Thr 


Ser 


Asp 


Pro 


Thr 


Glu 


Ala 


Thr 


Ala 


Val 


Gly 


Ala 










405 










410 










415 




Val 


Glu 


Ala 


Ser 


Phe 


Lys 


Cys 


Cys 


Ser Gly 


Ala 


He 


He 


Val 


Leu 


Thr 








420 










425 










430 






Lys 


Ser 


Gly 


Arg 


Ser 


Ala 


His. 


Gin 


Val 


Ala 


Arg 


Tyr Arg 


Pro 


Arg 


Ala 






435 










440 










445 








Pro 


He 


He 


Ala 


Val 


Thr 


Arg Asn 


Pro 


Gin 


Thr 


Ala Arg 


Gin 


Ala 


His 




450 










455 










460 










Leu 


Tyr 


Arg 


Gly 


He 


Phe 


Pro 


Val 


Leu 


Cys 


Lys 


Asp 


Pro 


Val 


Gin 


Glu 


465 










470 










475 










480 


Ala 


Trp 


Ala 


Glu 


Asp 


Val Asp Leu Arg Val 


Asn 


Phe 


Ala 


Met 


Asn 


Val 










4 85 










490 










*± y o 




Gly 


Lys 


Ala 


Arg 


Gly 


Phe 


Phe 


Lys 


Lys 


Gly 


Asp 


Val 


Val 


He 


Val 


Leu 








500 










505 










510 






Thr 


Gly 


Trp 


Arg 


Pro 


Gly 


Ser Gly 


Phe 


Thr 


Asn 


Thr 


Met 


Arg 


Val 


Val 






515 










520 










525 








Pro 


Val 


Pro 






























530 






























<210> 106 




























<211> 480 




























<212> PRT 




























<213> Homo sapiens 
























<400> 106 




























Met 


Ala 


Ala 


Arg 


Cys 


Ser 


Thr 


Arg 


Trp 


Leu 


Leu 


Val 


Val 


Val 


Gly 


Thr 


1 








5 










10 










15 




Pro 


Arg 


Leu 


Pro 


Ala 


He 


Ser Gly Arg 


Gly 


Ala 


Arg 


Pro 


Pro 


Arg 


Glu 








20 










25 










30 






Gly 


Val 


Val 


Gly 


Ala 


Trp 


Leu 


Ser 


Arg 


Lys 


Leu 


Ser 


Val 


Pro 


Ala 


Phe 






35 










40 










45 








Ala 


Ser 


Ser 


Leu 


Thr 


Ser Cys Gly Pro Arg 


Ala 


Leu 


Leu 


Thr 


Leu 


Arg 




50 










55 










60 










Pro 


Gly Val 


Ser 


Leu 


Thr Gly Thr 


Lys 


His 


Asn 


Pro 


Phe 


He 


Cys 


Thr 


65 










70 










75 










80 


Ala 


Ser 


Phe 


His 


Thr 


Ser 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu Asp 


Tyr 


Tyr 


Gin 










85 










90 










95 





BNSDOCID: <WO 0177327A1J_> 



WO 01/77327 PCT/US00/16951 

264 

lie Leu Gly Val Pro Arg Asn Ala Ser Gin Lys Glu lie Lys Lys Ala 

100 105 110 

Tyr Tyr Gin Leu Ala Lys Lys Tyr His Pro Asp Thr Asn Lys Asp Asp 

115 120 125 

Pro Lys Ala Lys Glu Lys Phe Ser Gin Leu Ala Glu Ala Tyr Glu Val 

130 135 140 

Leu Ser Asp Glu Val Lys Arg Lys Gin Tyr Asp Ala Tyr Gly Ser Ala 
14 5 150 155 160 

Gly Phe Asp Pro Gly Ala Ser Gly Ser Gin His Ser Tyr Trp Lys Gly 

165 170 175 

Gly Pro Thr Val Asp Pro Glu Glu Leu Phe Arg Lys lie Phe Gly Glu 

180 185 190 

Phe Ser Ser Ser Ser Phe Gly Asp Phe Gin Thr Val Phe Asp Gin Pro 

195 200 205 

Gin Glu Tyr Phe Met Glu Leu Thr Phe Asn Gin Ala Ala Lys Gly Val 

210 215 220 

Asn Lys Glu Phe Thr Val Asn lie Met Asp Thr c Cys Glu Arg Cys Asn 
225 230 235 " ~ 240 

Gly Lys Gly Asn Glu Pro Gly Thr Lys Val Gin His Cys His Tyr Cys 

245 250 255 

Gly Gly Ser Gly Met Glu Thr lie Asn Thr Gly Pro Phe Val Met Arg 

260 265 270 

Ser Thr Cys Arg Arg Cys Gly Gly Arg Gly Ser lie lie lie Ser Pro 

275 280 285 

Cys Val Val Cys Arg Gly Ala Gly Gin Ala Lys Gin Lys Lys Arg Val 

290 295 300 

Met lie Pro Val Pro Ala Gly Val Glu Asp Gly Gin Thr Val Arg Met 
305 310 315 * 320 

Pro Val Gly Lys Arg Glu lie Phe He Thr Phe Arg Val Gin Lys Ser 

325 330 335 

Pro Val Phe Arg Arg Asp Gly Ala Asp lie His Ser Asp Leu Phe He 

340 345 350 

Ser He Ala Gin Ala Leu Leu Gly Gly Thr Ala Arg, Ala Gin Gly Leu 

355 360 365 

Tyr Glu Thr lie Asn Val Thr He Pro Pro Gly Thr Gin Thr Asp Gin 

370 375 380 

Lys He Arg Met Gly Gly Lys Gly lie Pro Arg He Asn Ser Tyr Gly 
385 390 395 400 

Tyr Gly Asp His Tyr He His He Lys He Arg Val Pro Lys Arg Leu 

405 410 415 

Thr Ser Arg Gin Gin Ser Leu He Leu Ser Tyr Ala Glu Asp Glu Thr 

420 425 430 

Asp Val Glu Gly Thr Val Asn Gly Val Thr Leu Thr Ser Ser Gly Gly 

435 440 445 

Ser Thr Met Asp Ser Ser Ala Gly Ser Lys Ala Arg Arg Glu Ala Gly 

450 455 460 

Glu Asp Glu Glu Gly Phe Leu Ser Lys Leu Lys Lys Met Phe Thr Ser 
465 470 475 480 

<210> 107 
<211> 572 
<212> PRT 
<213> Homo sapiens 
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<400> 107 

Met Ala Ala Pro Arg Pro Ser Pro Ala lie Ser Val Ser Val Ser Ala 

15 10 15 

Pro Ala Phe Tyr Ala Pro Gin Lys Lys Phe Gly Pro Val Val Ala Pro 

20 25 30 

Lys Pro Lys Val Asn Pro Phe Arg Pro Gly Asp Ser Glu Pro Pro Pro 

35 40 45 

Ala Pro Gly Ala Gin Arg Ala Gin Met Gly Arg Val Gly Glu lie Pro 

50 55 60 

Pro Pro Pro Pro Glu Asp Phe Pro Leu Pro Pro Pro Pro Leu Ala Gly 
65 70 75 80 

Asp Gly Asp Asp Ala Glu Gly Ala Leu Gly Gly Ala Phe Pro Pro Pro 

85 90 95 

Pro Pro Pro lie Glu Glu Ser Phe Pro Pro Ala Pro Leu Glu Glu Glu 

100 105 110 

lie Phe Pro Ser Pro Pro Pro Pro Pro Glu Glu Glu Gly Gly Pro Glu 

115 120 125 

Ala Pro lie Pro Pro Pro Pro Gin Pro Arg Glu Lys Val Ser Ser lie 

130 . 135 140 - 

Asp Leu Glu lie Asp Ser Leu Ser Ser Leu Leu Asp Asp Met Thr Lys 
145 150 155 160 

Asn Asp Pro Phe Lys Ala Arg Val Ser Ser Gly Tyr Val Pro Pro Pro 

165 170 175 

Val Ala Thr Pro Phe Ser Ser Lys Ser Ser Thr Lys Pro Ala Ala Gly 

180 185 190 

Gly Thr Ala Pro Leu Pro Pro Trp Lys Ser Pro Ser Ser Ser Gin Pro 

195 200 205 

Leu Pro Gin Val Pro Ala Pro Ala Gin Ser Gin Thr Gin Phe His Val 

210 - 215 220 

Gin Pro Gin Pro Gin Pro Lys Pro Gin Val Gin Leu His Val Gin Ser 
225 230 235 240 

Gin Thr Gin Pro Val Ser Leu Ala Asn Thr Gin Pro Arg Gly Pro Pro 

245 250 255 

Ala Ser Ser Pro Ala Pro Ala Pro Lys Phe Ser Pro Val Thr Pro Lys 

260 265 270 

Phe Thr Pro Val Ala Ser Lys Phe Ser Pro Gly Ala Pro Gly Gly Ser 

275 280 285 

Gly Ser Gin Pro Asn Gin Lys Leu Gly His Pro Glu Ala Leu Ser Ala 

290 , 295 300 

Gly Thr Gly Ser Pro Gin Pro Pro Ser Phe Thr Tyr Ala Gin Gin Arg 
305 310 315 320 

Glu Lys Pro Arg Val Gin Glu Lys Gin His Pro Val Pro Pro Pro Ala 

325 330 335 

Gin Asn Gin Asn Gin Val Arg Ser Pro Gly Ala Pro Gly Pro Leu Thr 

340 345 350 

Leu Lys Glu Val Glu Glu Leu Glu Gin Leu Thr Gin Gin Leu Met Gin 

355 . 360 365 

Asp Met Glu His Pro Gin Arg Gin Asn Val Ala Val Asn Glu Leu Cys 

370 375 380 

Gly Arg Cys His Gin Pro Leu Ala Arg Ala Gin Pro Ala Val Arg Ala 
385 390 395 400 

Leu Gly Gin Leu Phe His lie Ala Cys Phe Thr Cys His Gin Cys Ala 



405 



410 



415 
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Gin 


Gin 


Leu 


Gin Gly Gin 








420 


Cys 


Glu 


Gly 


Cys Tyr Thr 






435 




Glu 


Pro 


lie 


Thr Asp Arg 




450 






Pro 


His 


Cys 


Phe Thr Cys 


465 






470 


Ser 


Phe 


lie 


Val Asp Gin 








485 


His 


Lys 


Gin 


Tyr Ala Pro 








500 


Pro 


Glu 


Pro 


Gly Arg Asp 






515 




Asn 


Phe 


His 


Met Lys Cys 




530 






Ser 


lie 


Glu 


Ala Asp Asp 


545 






550 


Leu 


Cys 


Arg 


Lys Cys His 








565 



<210> 108 








<211> 2861 








<212> PRT 








<213> Homo 


sapiens 




<400> 108 








Met Lys Ala Met 


Asp Val 


1 




5 




Leu Ser Gly Gly 


Arg Asp 




20 






Ala Arg Ser 


Asn 


His 


Asp 


35 








lie Ser Tyr 


Leu 


Ala 


Cys 


50 








Phe Thr Val 


He 


Val 


Asp 


65 






70 


Pro Leu Leu 


Lys 


He 


Leu 






85 




Ala Leu He 


He 


Lys 


Pro 




100 






Phe Gly Ser 


Ser 


Lys 


Phe 


115 








Gly Leu Thr 


Lys 


Val 


Val 


130 








Gly Cys Leu 


Glu 


Tyr 


Asn 


145 






150 


Phe Glu Asp 


Tyr 


He 


Ser 






165 




Glu Leu Gin 


Asp 


He 


Leu 




180 






Gly Ala Arg Asn 


Met 


He 



195 



266 

Gin Phe Tyr Ser Leu Glu 
425 

Asp Thr Leu Glu Lys Cys 
440 

Met Leu Arg Ala Thr Gly 
455 460 
Val Val Cys Ala Arg Pro 
475 

Ala Asn Arg Pro His Cys 
490 

Arg Cys Ser Val Cys Ser 
505 

Glu Thr Val Arg Val Val 
520 

Tyr Lys Cys Glu Asp Cys 
535 540 
Asn Gly Cys Phe Pro Leu 
555 

Thr Ala Arg Ala Gin Thr 
570 



Leu Pro He Leu Lys Glu 
10 

Lys Arg Gly Gly Pro He 
25 

Arg He Arg Gin Glu Asp 
40 

He Pro Ser .Glu Glu Val 
55 60 
Met Arg Gly Ser Lys Trp 
75 

Gin Glu Ser Phe Pro Cys 
90 

Asp Asn Phe Trp Gin Lys 
105 

Glu Phe Glu Thr Asn Met 
120 

Asp Pro Ser Gin Leu Thr 
135 140 
His Glu Glu Trp He Glu 
155 

Asn Ala Thr His Met Leu 
170 

Ala Lys Lys Glu Leu Pro 
185 

Glu Glu His Ser Gin Leu 
200 
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Gly Ala Pro Tyr 
430 

Asn Thr Cys Gly 
445 

Lys Ala Tyr His 

Leu Glu Gly Thr 
480 

Val Pro Asp Tyr 
495 

Glu Pro He Met 
510 

Ala Leu Asp Lys 
525 

Gly Lys Pro Leu 

Asp Gly His Val 
560 



Lys Val Ala Tyr 
■15 

Leu Thr Phe Pro 
30 

Leu Arg Arg Leu 
45 

Cys Lys Arg Gly 

Asp Ser He Lys 
80 

Cys He His Val 
95 

Gin Arg Thr Asn 
110 

Val Ser Leu Glu 
125 

Pro Glu Phe Asp 

He Arg Val Ala 
160 

Ser Arg Leu Glu 
175 

Gin Asp Leu Glu 
190 

Lys Lys Lys Val 
205 



8NSDOCI D: <WO 01 77327A1 J_> 




WO 01/77327 PCT/US00/16951 

267 



He 


Lys 


Ala 


Pro 


He 


Glu 


Asp 


Leu 


Asp 


Leu 


Glu 


Gly Gin Lys 


Leu 


Leu 




210 










215 










220 










Gin 


Arg 


He 


Gin 


Ser 


Ser 


Glu 


Ser 


Phe 


Pro 


Lys 


Lys 


Asn 


Ser 


Gly 


Ser 


225 










230 










235 










240 


Gly 


Asn 


Ala 


Asp 


Leu 


Gin 


Asn 


Leu 


Leu 


Pro 


Lys 


Val 


Ser 


Thr 


Met 


Leu 










245 










250 










255 




Asp 


Arg 


Leu 


His 


Ser 


Thr 


Arg 


Gin 


His 


Leu 


His 


Gin 


Met 


Trp 


His 


Val 








260 










265 










270 






Arg 


Lys 


Leu 


Lys 


Leu 


Asp 


Gin 


Cys 


Phe 


Gin 


Leu 


Arg 


Leu 


Phe 


Glu 


Gin 






275 










280 










285 








Asp 


Ala 


Glu 


Lys 


Met 


Phe 


Asp 


Trp 


He 


Thr 


His 


Asn 


Lys 


Gly 


Leu 


Phe 




290 










295 










300 










Leu 


Asn 


Ser 


Tyr 


Thr 


Glu 


He 


Gly 


Thr 


Ser 


His 


Pro 


His 


Ala 


Met 


Glu 


305 










310 










315 










320 


Leu 


Gin 


Thr 


Gin 


His 


Asn 


His 


Phe 


Ala 


Met 


Asn 


Cys 


Met 


Asn 


Val 


Tyr 










325 










330 










335 




Val 


Asn 


He 


Asn 


Arg 


He 


Met 


Ser 


Val 


Ala 


Asn 


Arg 


Leu 


Val 


Glu 


Ser 








340 










345 










350 






Gly 


His 


Tyr 


Ala 


Ser 


Gin 


Gin 


He 


Arg 


Gin 


He 


Ala 


Ser 


Gin 


Leu 


Glu 






355 










3 60 










365 








Gin 


Glu 


Trp 


Lys 


Ala 


Phe 


Ala 


Ala 


Ala Leu Asp 


Glu 


Arg 


Ser 


Thr 


Leu 




370 










375 










380 










Leu 


Asp 


Met 


Ser 


Ser 


He 


Phe 


His 


Gin 


Lys 


Ala 


Glu 


Lys 


Tyr 


Met 


Ser 


385 










390 










395 










400 


Asn 


Val 


Asp 


Ser 


Trp 


Cys 


Lys 


Ala 


Cys 


Gly Glu 


Val 


Asp 


Leu 


Pro 


Ser 










405 










410 










415 




Glu 


Leu 


Gin 


Asp 


Leu 


Glu 


Asp 


Ala 


He 


His 


His 


His 


Gin 


Gly 


He 


Tyr 








420 










425 










430 






Glu 


His 


He 


Thr 


Leu 


Ala 


Tyr 


Ser 


Glu 


Val 


Ser 


Gin Asp Gly 


Lys 


Ser 






435 










440 










445* 








Leu 


Leu 


Asp 


Lys 


Leu 


Gin 


Arg 


Pro 


Leu 


Thr 


Pro 


Gly 


Ser 


Ser 


Asp 


Ser 




450 










455 










460 










Leu 


Thr 


Ala 


Ser 


Ala 


Asn 


Tyr 


Ser 


Lys 


Ala 


Val 


His 


His 


Val 


Leu 


Asp 


465 










470 










475 










480 


Val 


He 


His 


Glu 


Val 


Leu 


His 


His 


Gin Arg His 


Val 


Arg 


Thr 


He 


Trp 










485 










490 










495 




Gin 


His 


Arg 


Lys 


Val 


Arg 


Leu 


His 


Gin Arg Leu 


Gin 


Leu 


Cys 


Val 


Phe 








500 










505 










510 






Gin 


Gin 


Glu 


Val 


Gin 


Gin 


Val 


Leu 


Asp 


Trp 


He 


Glu 


Asn 


His 


Gly 


Glu 






515 










520 










525 








Ala 


Phe 


Leu 


Ser 


Lys 


His 


Thr Gly 


Val 


Gly Lys 


Ser 


Leu 


His 


Arg 


Ala 




530 










535 










540 










Arg 


Ala 


Leu 


Gin 


Lys 


Arg 


His 


Glu 


Asp 


Phe 


Glu 


Glu 


Val 


Ala 


Gin 


Asn 


545 










550 










555 










560 


Thr 


Tyr 


Thr 


Asn 


Ala 


Asp 


Lys 


Leu 


Leu 


Glu 


Ala 


Ala 


Glu 


Gin 


Leu 


Ala 










565 










570 










575 




Gin 


Thr 


Gly 


Glu 


Cys 


Asp 


Pro 


Glu 


Glu 


He 


Tyr 


Gin 


Ala 


Ala 


His 


Gin 








580 










585 










590 






Leu 


Glu 


Asp 


Arg 


He 


Gin 


Asp 


Phe 


Val 


Arg Arg 


Val 


Glu 


Gin 


Arg 


Lys 






595 










600 










605 








He 


Leu 


Leu 


Asp 


Met 


Ser 


Val 


Ser 


Phe 


His 


Thr 


His 


Val 


Lys 


Glu 


Leu 




610 










615 










62 0 










Trp 


Thr 


Trp 


Leu 


Glu 


Glu 


Leu 


Gin 


Lys 


Glu 


Leu 


Leu 


Asp 


Asp 


Val 


Tyr 
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625 630 635 640 

Ala Glu Ser Val Glu Ala Val Gin Asp Leu lie Lys Arg Phe Gly Gin 

645 650 655 

Gin Gin Gin Thr Thr Leu Gin Val Thr Val Asn Val lie Lys Glu Gly 

660 665 670 

Glu Asp Leu He Gin Gin Leu Arg Asp Ser Ala He Ser Ser Asn Lys 

675 680 685 

Thr Pro His Asn Ser Ser He Asn His He Glu Thr Val Leu Gin Gin 

690 695 700 

Leu Asp Glu Ala Gin Ser Gin Met Glu Glu Leu Phe Gin Glu Arg Lys 
705 710 715 720 

He Lys Leu Glu Leu Phe Leu His Val Arg He Phe Glu Arg Asp Ala 

725 730 735 

He Asp He He Ser Asp Leu Glu Ser Trp Asn Asp Glu Leu Ser Gin 

740 745 750 

Gin Met Asn Asp Phe Asp Thr Glu Asp Leu Thr He Ala Glu Gin Arg 

755 760 765 

Leu Gin His His Ala Asp Lys Ala Leu Thr Met Asn Asn Leu Thr Phe 

770 775 780 

Asp Val He His Gin Gly Gin Asp Leu Leu Gin Tyr Val Asn Glu Val 
785 790 795 800 

Gin Ala Ser Gly Val Glu Leu Leu Cys Asp Arg Asp Val Asp Met Ala 

805 810 815 

Thr Arg Val Gin Asp Leu Leu Glu Phe Leu His Glu Lys Gin Gin Glu 

820 825 830 

Leu Asp Leu Ala Ala Glu Gin His Arg Lys His Leu Glu Gin Cys Val 

835 840 845 

Gin Leu Arg His Leu Gin Ala Glu Val Lys Gin Val Leu Gly Trp He 

850 855 860 

Arg Asn Gly Glu Ser Met Leu Asn Ala Gly Leu He Thr Ala Ser Ser 
865 870 875 880 

Leu Gin Glu Ala Glu Gin Leu Gin Arg Glu His Glu Gin Phe Gin His 

885 890 895 

Ala He Glu Lys Thr His Gin Ser Ala Leu Gin Val Gin Gin Lys Ala 

900 905 910 

Glu Ala Met Leu Gin Ala Asn His Tyr Asp Met Asp Met He Arg Asp 

915 920 925 

Cys Ala Glu Lys Val Ala Ser His Trp Gin Gin Leu Met Leu Lys Met 

930 935 940 

Glu Asp Arg Leu Lys Leu Val Asn Ala Ser Val Ala Phe Tyr Lys Thr 
945 950 955 " 960 

Ser Glu Gin Val Cys Ser Val Leu Glu Ser Leu Glu Gin Glu Tyr Lys 

965 970 975 

Arg Glu Glu Asp Trp Cys Gly Gly Ala Asp Lys Leu Gly Pro Asn Ser 

980 985 990 

Glu Thr Asp His Val Thr Pro Met He Ser Lys His Leu Glu Gin Lys 

995 1000 1005 

Glu Ala Phe Leu Lys Ala Cys Thr Leu Ala Arg Arg Asn Ala Asp Val 

1010 1015 1020 

Phe Leu Lys Tyr Leu His Arg Asn Ser Val Asn Met Pro Gly Met Val 
1025 , 1030 1035 1040 

Thr His He Lys Ala Pro Glu Gin Gin Val Lys Asn He Leu Asn Glu 
1045 1050 1055 
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Leu Phe Gin Arg Glu 
1060 



Asn Arg Val Leu His Tyr Trp Thr Met Arg 
1065 1070 



Lys 



Arg Arg Leu Asp Gin Cys Gin Gin Tyr Val Val Phe Glu Arg Ser Ala 

1075 1080 1085 

Lys Gin Ala Leu Glu Trp lie His Asp Asn Gly Glu Phe Tyr Leu Ser 

1090 1095 1100 

Thr His Thr Ser Thr Gly Ser Ser lie Gin His Thr. Gin Glu Leu Leu 
1105 1110 1115 112< 

Lys Glu His Glu Glu Phe Gin lie Thr Ala Lys Gin Thr Lys Glu Arg 

1125 1130 1135 

Val Lys Leu Leu He Gin Leu Ala Asp Gly Phe Cys Glu Lys Gly His 

1140 1145 1150 

Ala His Ala Ala Glu He Lys Lys Cys Val Thr Ala Val Asp Lys Arg 

1155 1160 1165 

Tyr Arg Asp Phe Ser Leu Arg Met Glu Lys Tyr Arg Thr Ser Leu Glu 

1170 1175 1180 

Lys Ala Leu Gly He Ser Ser Asp Ser Asn Lys Ser Ser Lys Ser Leu 
1185 1190 1195 120( 

Gin Leu Asp He He Pro Ala Ser He Pro Gly Ser Glu Val Lys Leu 

1205 1210 1215 

Arg Asp Ala Ala His Glu Leu Asn Glu Glu Lys Arg Lys Ser Ala Arg 

1220 1225 1230 

Arg Lys Glu Phe He Met Ala Glu Leu He Gin Thr Glu Lys Ala Tyr 

1235 1240 1245 

Val Arg Asp Leu Arg Glu Cys Met Asp Thr Tyr Leu Trp Glu Met Thr 

1250 1255 1260 

Ser Gly Val Glu Glu He Pro Pro Gly He Val Asn Lys Glu Leu He 
1265 1270 1275 128( 

He Phe Gly Asn Met Gin Glu He Tyr Glu Phe His Asn Asn He Phe 

1285 1290 1295 

Leu Lys Glu Leu Glu Lys Tyr Glu Gin Leu Pro Glu Asp Val Gly His 

1300 1305 1310 

Cys Phe Val Thr Trp Ala Asp Lys Phe Gin Met Tyr Val Thr Tyr Cys 

1315 1320 " 1325 

Lys Asn Lys Pro Asp Ser Thr Gin Leu He Leu Glu His Ala Gly Ser 

1330 1335 . 1340 

Tyr Phe Asp Glu He Gin Gin Arg His Gly Leu Ala Asn Ser He Ser 
1345 1350 1355 136C 

Ser Tyr Leu He Lys Pro Val Gin Arg He Thr Lys Tyr Gin Leu Leu 

1365 1370 1375 

Leu Lys Glu Leu Leu Thr Cys Cys Glu Glu Gly Lys Gly Glu He Lys 

1380 1385 1390 

Asp Gly Leu Glu Val Met Leu Ser Val Pro Lys Arg Ala Asn Asp Ala 

1395 1400 1405 

Met His Leu Ser Met Leu Glu Gly Phe Asp Glu Asn He Glu Ser Gin 

1410 1415 1420 

Gly Glu Leu He Leu Gin Glu Ser Phe Gin Val Trp Asp Pro Lys Thr 
1425 1430 1435 144C 

Leu He Arg Lys Gly Arg Glu Arg His Leu Phe Leu Phe Glu Met Ser 

1445 1450 ' 1455 

Leu Val Phe Ser Lys Glu Val Lys Asp Ser Ser Gly Arg Ser Lys Tyr 

1460 1465 1470 

Leu Tyr Lys Ser Lys Leu Phe Thr Ser Glu Leu Gly Val Thr Glu His 
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1475 1480 1485 

Val Glu Gly Asp Pro Cys Lys Phe Ala Leu Trp Val Gly Arg Thr Pro 

1490 1495 1500 

Thr Ser Asp Asn Lys lie Val Leu Lys Ala Ser Ser lie Glu Asn Lys 
1505 1510 1515 1520 

Gin Asp Trp He Lys His He Arg Glu Val He Gin Glu Arg Thr He 

1525 1530 1535 

His Leu Lys Gly Ala Leu Lys Glu Pro He His He Pro Lys Thr Ala 

1540 1545 1550 

Pro Ala Thr Arg Gin Lys Gly Arg Arg Asp Gly Glu Asp Leu Asp Ser 

1555 1560 1565 

Gin Gly Asp Gly Ser Ser Gin Pro Asp Thr He Ser He Ala Ser Arg 

1570 1575 1580 

Thr Ser Gin Asn Thr Leu Asp Ser Asp Lys Leu Ser Gly Gly Cys Glu 
1585 1590 1595 1600 

Leu Thr Val Val He His Asp Phe Thr Ala Cys Asn Ser Asn Glu Leu 

1605 1610 1615 

Thr He Arg Arg Gly Gin Thr Val Glu Val Leu Glu Arg Pro His Asp 

1620 . 1625 ■ .1630 

Lys Pro Asp Trp Cys Leu Val Arg Thr Thr Asp Arg Ser Pro Ala Ala 

1635 1640 1645 

Glu Gly Leu Val Pro Cys Gly Ser Leu Cys He Ala His Ser Arg Ser 

1650 1655 1660 

Ser Met Glu Met Glu Gly He Phe Asn His Lys Asp Ser Leu Ser Val 
1665 1670 1675 1680 

Ser Ser Asn Asp Ala Ser Pro Pro Ala Ser Val Ala Ser Leu Gin Pro 

1685 1690 1695 

His Met He Gly Ala Gin Ser Ser Pro Gly Pro Lys Arg Pro Gly Asn 

1700 1705 1710 

Thr Leu Arg Lys Trp Leu Thr Ser Pro Val Arg Arg Leu Ser Ser Gly 

1715 1720 1725 

Lys Ala Asp Gly His Val Lys Lys Leu Ala His Lys His Lys Lys Ser 

1730 1735 1740 

Arg Glu Val Arg Lys Ser Ala Asp Ala Gly Ser Gin Lys Asp Ser Asp 
1745 1750 1755 1760 

Asp Ser Ala Ala Thr Pro Gin Asp Glu Thr Val Glu Glu Arg Gly Arg 

1765 1770 1775 

Asn Glu Gly Leu Ser Ser Gly Thr Leu Ser Lys Ser Ser Ser Ser Gly 

1780 1785 1790 

Met Gin Ser Cys Gly Glu Glu Glu Gly Glu Glu Gly Ala Asp Ala Val 

1795 1800 1805 

Pro Leu Pro Pro Pro Met Ala He Gin Gin His Ser Leu Leu Gin Pro 

1810 1815 1820 

Asp Ser Gin Asp Asp Lys Ala Ser Ser Arg Leu Leu Val Arg Pro Thr 
1825 1830 1835 1840 

Ser Ser Glu Thr Pro Ser Ala Ala Glu Leu Val Ser Ala He Glu Glu 

1845 1850 1855 

Leu Val Lys Ser Lys Met Ala Leu Glu Asp Arg Pro Ser Ser Leu Leu 

1860 1865 1870 

Val Asp Gin Gly Asp Ser Ser Ser Pro Ser Phe Asn Pro Ser Asp Asn 

1875 1880 1885 

Ser Leu Leu Ser Ser Ser Ser Pro lie Asp Glu Met Glu Glu Arg Lys 
1890 1895 1900 
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Ser Ser Ser Leu Lys Arg Arg His Tyr Val Leu Gin Glu Leu Val Glu 
1905 1910 1915 1920 

Thr Glu Arg Asp Tyr Val Arg Asp Leu Gly Tyr Val Val Glu Gly Tyr 

1925 1930 1935 

Met Ala Leu Met Lys Glu Asp Gly Val Pro Asp Asp Met Lys Gly Lys 

1940 1945 1950 

Asp Lys lie Val Phe Gly Asn lie His Gin lie Tyr Asp Trp His Arg 

1955 1960 1965 

Asp Phe Phe Leu Gly Glu Leu Glu Lys Cys Leu Glu Asp Pro Glu Lys 

1970 1975 1980 

Leu Gly Ser Leu Phe Val Lys His Glu Arg Arg Leu His Met Tyr lie 
1985 1990 1995 2000 

Ala Tyr Cys Gin Asn Lys Pro Lys Ser Glu His lie Val Ser Glu Tyr 

2005 2010 2015 

lie Asp Thr Phe Phe Glu Asp Leu Lys Gin Arg Leu Gly His Arg Leu 

2020 2025 2030 

Gin Leu Thr Asp Leu Leu lie Lys Pro Val. Gin Arg lie Met Lys Tyr 

2035 2040 2045 

Gin Leu Leu Leu Lys Asp Phe Leu Lys Tyr Ser Lys Lys Ala Ser Leu 

2050 2055 2060 

Asp Thr Ser Glu Leu Glu Arg Ala Val Glu Val Met Cys He Val Pro 
2065 2070 2075 2080 

Arg Arg Cys Asn Asp Met Met Asn Val Gly Arg Leu Gin Gly Phe Asp 

2085 2090 2095 

Gly Lys He Val Ala Gin Gly Lys Leu Leu Leu Gin Asp Thr Phe Leu 

2100 2105 2110 

Val Thr Asp Gin Asp Ala Gly Leu Leu Pro Arg Cys Arg Glu Arg Arg 

2115 2120 2125 

He Phe Leu Phe Glu Gin He Val He Phe Ser Glu Pro Leu Asp Lys 

2130 2135 2140 

Lys Lys Gly Phe Ser Met Pro Gly Phe Leu Phe Lys Asn Ser He Lys 
2145 2150 2155 2160 

Val Ser Cys Leu Cys Leu Glu Glu Asn Val Glu Asn Asp Pro Cys Lys 

2165 2170 2175 

Phe Ala Leu Thr Ser Arg Thr Gly Asp Val Val Glu Thr Phe He Leu 

2180 2185 2190 

His Ser Ser Ser Pro Ser Val - Arg Gin Thr Trp He His Glu He Asn 

2195 . 2200 2205 

Gin He Leu Glu Asn Gin Arg Asn Phe Leu Asn Ala Leu Thr Ser Pro 

2210 2215 2220 

He Glu Tyr Gin Arg Asn His Ser Gly Gly Gly Gly Gly Gly Gly Ser 
2225 2230 2235 2240 

Gly Ala Ala Ala Gly Val Gly Ala Ala Ala Ala Ala Gly Pro Pro Val 

2245 2250 2255 

Ala Ala Ala Ala Thr Val Ala Ala Pro Ala Ala Ala Ala Ala Pro Pro 

2260 2265 2270 

Ala Arg Ala Gly Ala Gly Pro Pro Gly Ser Pro Ser Leu Ser Asp Thr 

2275 2280 2285 

Thr Pro Pro Cys Trp Ser Pro Leu Gin Pro Arg Ala Arg Gin Arg Gin 

2290 2295 2300 

Thr Arg Cys Gin Ser Glu Ser Ser Ser Ser Ser Asn He Ser Thr Met 
2305 2310 2315 2320 

Leu Val Thr His Asp Tyr Thr Ala Val Lys Glu Asp Glu He Asn Val 
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2325 2330 2335 

Tyr Gin Gly Glu Val Val Gin He Leu Ala Ser Asn Gin Gin Asn Met 

2340 2345 2350 

Phe Leu Val Phe Arg Ala Ala Thr Asp Gin Cys Pro Ala Ala Glu Gly 

2355 2360 2365 

Trp He Pro Gly Phe Val Leu Gly His Thr Ser Ala Val lie Val Glu 

2370 2375 2380 

Asn Pro Asp Gly Thr Leu Lys Lys Ser Thr Ser Trp His Thr Ala Leu 
2385 2390 2395 2400 

Arg Leu Arg Lys Lys Ser Glu Lys Lys Asp Lys Asp Gly Lys Arg Glu 

2405 2410 2415 

Gly Lys Leu Glu Asn Gly Tyr Arg Lys Ser Arg Glu Gly Leu Ser Asn 

2420 2425 ^ 2430 

Lys Val Ser Val Lys Leu Leu Asn Pro Asn Tyr He Tyr Asp Val Pro 

2435 2440 2445 

Pro Glu Phe Val He Pro Leu Ser Glu Val Thr Cys Glu Thr Gly Glu 

2450 2455 2460 

Thr Val Val Leu Arg Cys Arg Val Cys Gly Arg Pro Lys Ala Ser He 
2465 2470 2475 2480 

Thr Trp Lys Gly Pro Glu His Asn Thr Leu Asn Asn Asp Gly His Tyr 

2485 2490 2495 

Ser He Ser Tyr Ser Asp Leu Gly Glu Ala Thr Leu Lys He Val Gly 

2500 2505 2510 

Val Thr Thr Glu Asp Asp Gly lie Tyr Thr Cys He Ala Val Asn Asp 

2515 2520 2525 

Met Gly Ser Ala Ser Ser Ser Ala Ser Leu Arg Val Leu Gly Pro Gly 

2530 2535 2540 

Met Asp Gly He Met Val Thr Trp Lys Asp Asn Phe Asp Ser Phe Tyr 
2545 2550 2555 ^ 2560 

Ser Glu Val Ala Glu Leu Gly Arg Gly Arg Phe Ser Val Val Lys Lys 

2565 2570 2575 

Cys Asp Gin Lys Gly Thr Lys Arg Ala Val Ala Thr Lys Phe Val Asn 

2580 2585 2590 

Lys Lys Leu Met Lys Arg Asp Gin Val Thr His Glu Leu Gly He Leu 

2595 2600 2605 

Gin Ser Leu Gin His Pro Leu Leu Val Gly Leu Leu Asp Thr Phe Glu 

2610 2615 2620 

Thr Pro Thr Ser Tyr He Leu Val Leu Glu Met Ala Asp Gin Gly Arg 
2625 2630 2635 2640 

Leu Leu Asp Cys Val Val Arg Trp Gly Ser Leu Thr Glu Gly Lys He 

2645 2650 2655 

Arg Ala His Leu Gly Glu Val Leu Glu Ala Val Arg Tyr Leu His Asn 

2660 . 2665 2670 

Cys Arg He Ala His Leu Asp Leu Lys Pro Glu Asn He Leu Val Asp 

2675 2680 2685 

Glu Ser Leu Ala Lys Pro Thr He Lys Leu Ala Asp Phe Gly Asp Ala 

2690 2695 ' 2700 

Val Gin Leu Asn Thr Thr Tyr Tyr He His Gin Leu Leu Gly Asn Pro 
2705 2710 2715 2720 

Glu Phe Ala Ala Pro Glu He He Leu Gly Asn Pro Val Ser Leu Thr 

2725 2730 2735 

Ser Asp Thr Trp Ser Val Gly Val Leu Thr Tyr Val Leu Leu Ser Gly 
2740 2745 2750 
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./ 

Val Ser Pro Phe Leu Asp Asp Ser Val Glu Glu Thr Cys Leu Asn lie 

2755 2760 2765 

Cys Arg Leu Asp Phe Ser Phe Pro Asp Asp Tyr Phe Lys Gly Val Ser 

2770 2775 2780 

Gin Lys Ala Lys Glu Phe Val Cys Phe Leu Leu Gin Glu Asp Pro Ala 
2785 2790 2795 2800 

Lys Arg Pro Ser Ala Ala Leu Ala Leu Gin Glu Gin Trp Leu Gin Ala 

2805 2810 2815 

Gly Asn Gly Arg Ser Thr Gly Val Leu Asp Thr Ser Arg Leu Thr Ser 

2820 2825 2830 

Phe lie Glu Arg Arg Lys His Gin Asn Asp Val Arg Pro lie Arg Ser 

2835 2840 2845 

lie Lys Asn Phe Leu Gin Ser Arg Leu Leu Pro Arg Val 
2850 2855 2860 

<210> 109 
<211> 271 
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Met Arg Lys Arg Gly Ser Val Arg Gly Thr Ser Ala Ala Asp Val 
260 265 270 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 



1. Claims: Invention 1: Claims 1-50 and 53-69 (all 
completely) and claims 51, 74, 75, 
78 and 91 (all partially) 



The HBM polynucleotide and HBM polypeptide variant of the 
polymorphic Zmaxl gene, said polynucleotide and polypeptide 
of SeqIdNo.2 and SeqIdNo.4 respectively, said polynucleotide 
comprising at least 15 contiguous nucleotides of SeqIdNo.2 
wherein one of the at least 15 contiguous nucleotides is 
thymine at position 582; applications thereof. 



2. Claims: Inventions 2 to 25: Claims 51, 52, 70, 
72-74 (all partially) 

A polymorphic variant of the Zmax 1 gene, wherein invention 
2 is limited to SeqIdNo.9 wherein nucleotide 69169 is 
replaced by A, 

invention 3 to SeqIdNo.9 wherein nucleotide 27402 is 
replaced by G, 

invention 4 to SeqIdNo.9 wherein nucleotide 27841 is 
replaced by C, 

invention 5 to SeqIdNo.9 wherein nucleotide 35600 is 
replaced by 6, 

invention 6 to SeqIdNo.9 wherein nucleotide 45619 is 
replaced by A, 

invention 7 to SeqIdNo.9 wherein nucleotide 46018 is 
replaced by G, 

invention 8 to SeqIdNo.9 wherein nucleotide 46093 is 
replaced by G, 

invention 9 to SeqIdNo.9 wherein nucleotide 46190 is 
replaced by G, 

invention 10 to SeqIdNo.9 wherein nucleotide 50993 is 
replaced by C, 

invention 11 to SeqIdNo.9 wherein nucleotide 51124 is 
replaced by T, 

invention 12 to SeqIdNo.9 wherein nucleotide 55461 is 
replaced by T, 

invention 13 to SeqIdNo.9 wherein nucleotide 63645 is 
replaced by A, 

invention 14 to SeqIdNo.9 wherein nucleotide 63646 is 
replaced by C, 

invention 15 to SeqIdNo.9 wherein nucleotide 24809 is 
replaced by G, 

invention 16 to SeqIdNo.9 wherein nucleotide 27837 is 
replaced by C, 

invention 17 to SeqIdNo.9 wherein nucleotide 31485 is 
replaced by T, 

invention 18 to SeqIdNo.9 wherein nucleotide 31683 is 
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replaced by 6, 

invention 19 to SeqIdNo.9 wherein nucleotide 24808 is 
replaced by G, 

invention 20 to SeqIdNo.8 wherein nucleotide 31340 is 
replaced by C, 

invention 21 to SeqIdNo.8 wherein nucleotide 32538 is 
replaced by G, 

invention 22 to SeqIdNo.8 wherein nucleotide 13224 is 
replaced by G, 

invention 23 to SeqIdNo.8 wherein nucleotide 30497 is 
replaced by A, 

invention 24 to SeqIdNo.9 wherein nucleotide 24811 is 
replaced by C, 

invention 25 to SeqIdNo.9 wherein nucleotide 68280 is 
replaced by A. 



3. Claims: Invention 26: Claim 71 (completely) and claims 51, 
52, 70, 72-74 (all partially) 



As for invention 2 but limited to SeqIdNo.8 wherein 
nucleotide 21119 is replaced by A. 



4. Claims: Inventions 27-50: claims 75-93 (all partially. 



as applicable) 



A molecule involved in bone modulation that is, binds to or 
inhibits binding of a molecule to a protein involved in 
focal adhesion signaling, and applications thereof, wherein 
invention 27 is limited to a molecule that is, binds to or 
inhibits binding of a molecule to the protein of SeqIdNo.87 
or the corresponding nucleic acid of SeqIdNo.63, 
invention 28 to the protein of SeqIdNo.88 or the 
corresponding nucleic acid of SeqIdNo.64, 
invention 29 to the protein of SeqIdNo.89 or the 
corresponding nucleic acid of SeqIdNo.65, 
invention 30 to the protein of SeqIdNo.90 or the 
corresponding nucleic acid of SeqIdNo.66, 
invention 31 to the protein of SeqIdNo.91 or the 
corresponding nucleic acid of SeqIdNo.67, 
invention 32 to the protein of SeqIdNo.92 or the 
corresponding nucleic acid of SeqIdNo.68, 
invention 33 to the protein of SeqIdNo.93 or the 
corresponding nucleic acid of SeqIdNo.69, 
invention 34 to the nucleic acid of SeqIdNo.70, 
invention 35 to the protein of SeqIdNo.94 or the 
corresponding nucleic acid of SeqldNo.71, 
invention 36 to the protein of SeqIdNo.95 or the 
corresponding nucleic acid of SeqIdNo.72, 
invention 37 to the protein of SeqIdNo.96 or the 
corresponding nucleic acid of SeqIdNo.73, 
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invention 38 to the protein of SeqIdNo.97 or the 
corresponding nucleic acid of SeqldNo. 74, 
invention 39 to the protein of SeqldNo. 98 or the 
corresponding nucleic acid of SeqldNo. 75, 
invention 40 to the protein of SeqldNo. 99 or the 
corresponding nucleic acid of SeqldNo. 76, 
invention 41 to the protein of SeqldNo. 100 or the 
corresponding nucleic acid of SeqldNo. 77, 
invention 42 to the protein of SeqldNo. 101 or the 
corresponding nucleic acid of SeqldNo. 78, 
invention 43 to the protein of SeqldNo. 102 or the 
corresponding nucleic acid of SeqldNo. 79, 
invention 44 to the protein of SeqldNo. 103 or the 
corresponding nucleic acid of SeqldNo. 80, 
invention 45 to the protein of SeqldNo. 104 or the 
corresponding nucleic acid of SeqldNo. 81, 
invention 46 to the protein of SeqldNo. 105 or the 
corresponding nucleic acid of SeqldNo. 82, 
invention 47 to the protein of SeqldNo. 106 or the 
corresponding nucleic acid of SeqldNo. 83, 
invention 48 to the protein of SeqldNo. 107 or the 
corresponding nucleic acid of SeqldNo. 84, 
invention 49 to the protein of SeqldNo. 108 or the 
corresponding nucleic acid of SeqldNo. 85, 
invention 50 to the protein of SeqldNo. 109 or the 
corresponding nucleic acid of SeqldNo. 86. 
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Continuation of Box 1.2 
Claims Nos.: 43,44 



Present claims 43 and 44 relate to a compound defined by reference to a 
desirable characteristic or property, namely that it binds to the nucleic 
acid sequence of claim 1. The claims cover all compounds having this 
characteristic or property, whereas the application provides support 
within the meaning of Article 6 PCT and/or disclosure within the meaning 
of Article 5 PCT for no such compound* In the present case, the claims so 
lack support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the compounds by reference to a result to be 
achieved. Again, this lack of clarity in the present case is such as to 
render a meaningful search over the whole of the claimed scope 
impossible. Consequently, no search can be carried out for such 
speculative claims, the wording of which is a mere recitation of the 
results to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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